ElevenLabs Text To Speech
Convert text and documents to high-quality audio using ElevenLabs TTS API, supporting single narrator and dual-host podcast generation modes
Install
Quick install
npx skills add https://github.com/sanjay3290/ai-skills/tree/main/skills/elevenlabsnpx skills add sanjay3290/ai-skills --skill "ElevenLabs Text-to-Speech" --agent claude-codenpx skills add sanjay3290/ai-skills --skill "ElevenLabs Text-to-Speech" --agent cursornpx skills add sanjay3290/ai-skills --skill "ElevenLabs Text-to-Speech" --agent codexnpx skills add sanjay3290/ai-skills --skill "ElevenLabs Text-to-Speech" --agent opencodenpx skills add sanjay3290/ai-skills --skill "ElevenLabs Text-to-Speech" --agent github-copilotnpx skills add sanjay3290/ai-skills --skill "ElevenLabs Text-to-Speech" --agent windsurfMore install options
Shorthand — useful for multi-skill repos:
npx skills add sanjay3290/ai-skills --skill "ElevenLabs Text-to-Speech"Manual — clone the repo and drop the folder into your agent's skills directory:
git clone https://github.com/sanjay3290/ai-skills.gitcp -r ai-skills/skills/elevenlabs ~/.claude/skills/ElevenLabs Text-to-Speech
Convert text and documents to high-quality audio using ElevenLabs TTS API, supporting single narrator and dual-host podcast generation modes
What is it?
This skill converts text and documents into high-quality audio using ElevenLabs TTS API. It supports two modes: single-voice narration and two-host conversational podcast generation.
How to use it?
Config atskills/elevenlabs/config.json:
`{
"api_key": "your-elevenlabs-api-key",
"default_voice": "JBFqnCBsd6RMkjVDRZzb",
"default_model": "eleven_multilingual_v2",
"podcast_voice1": "JBFqnCBsd6RMkjVDRZzb",
"podcast_voice2": "EXAVITQu4vr4xnSDxMaL"
}
`
Only api_key is required. Or set ELEVENLABS_API_KEY env var.
Dependencies: pip install PyPDF2 python-docx (only needed for PDF/DOCX files).
Requires ffmpeg for multi-chunk narration and podcasts.
Key Features
- Convert text and documents to high-quality audio using ElevenLabs TTS API, supporting single narrator and dual-host podcast generation modes
- Seamless integration with Claude's development workflow
- Comprehensive guidelines and best practices for elevenlabs text-to-speechView on GitHub
GitHub Stats
StarsForksLast UpdateAuthorsanjay3290LicenseApache-2.0Version1.0.0Categories
CreativeProductivityTags
text-to-speechelevenlabspodcastaudiottsFeatures
Related Skills
More from CreativeGoogle Cloud Text-to-Speech
Convert text and documents to audio using Google Cloud TTS API with Neural2/WaveNet/Studio voices, 40+ languages, and dual-host podcast generation79sanjay3290CreativeProductivity00
Doc Co-Authoring Workflow
Structured workflow for collaboratively writing documentation, proposals, and technical specs with Claude through three stages: context gathering, iterative refinement, and reader testing5.7kanthropicsProductivityCreative00
Tailored Resume Generator
Analyzes job descriptions and generates tailored resumes with ATS optimization, covering career transitions, senior executives, and recent graduates3.2kComposioHQProductivityCreative00
---
Source: https://github.com/sanjay3290/ai-skills/tree/main/skills/elevenlabs
Author: sanjay3290
License: https://opensource.org/licenses/Apache-2.0
GitHub Stars: 79
Tags: text-to-speech, elevenlabs, podcast, audio, tts
SKILL.md source
---
name: ElevenLabs Text-to-Speech
description: Convert text and documents to high-quality audio using ElevenLabs TTS API, supporting single narrator and dual-host podcast generation modes
---
# ElevenLabs Text-to-Speech
Convert text and documents to high-quality audio using ElevenLabs TTS API, supporting single narrator and dual-host podcast generation modes
What is it?
This skill converts text and documents into high-quality audio using ElevenLabs TTS API. It supports two modes: single-voice narration and two-host conversational podcast generation.
## How to use it?
Config at `skills/elevenlabs/config.json`:
```
`{
"api_key": "your-elevenlabs-api-key",
"default_voice": "JBFqnCBsd6RMkjVDRZzb",
"default_model": "eleven_multilingual_v2",
"podcast_voice1": "JBFqnCBsd6RMkjVDRZzb",
"podcast_voice2": "EXAVITQu4vr4xnSDxMaL"
}
`
```
Only `api_key` is required. Or set `ELEVENLABS_API_KEY` env var.
Dependencies: `pip install PyPDF2 python-docx` (only needed for PDF/DOCX files).
Requires `ffmpeg` for multi-chunk narration and podcasts.
## Key Features
* Convert text and documents to high-quality audio using ElevenLabs TTS API, supporting single narrator and dual-host podcast generation modes
* Seamless integration with Claude's development workflow
* Comprehensive guidelines and best practices for elevenlabs text-to-speechView on GitHub
### GitHub Stats
StarsForksLast UpdateAuthorsanjay3290LicenseApache-2.0Version1.0.0
### Categories
CreativeProductivity
### Tags
text-to-speechelevenlabspodcastaudiotts
### Features
## Related Skills
More from Creative
### Google Cloud Text-to-Speech
Convert text and documents to audio using Google Cloud TTS API with Neural2/WaveNet/Studio voices, 40+ languages, and dual-host podcast generation
79sanjay3290CreativeProductivity00
### Doc Co-Authoring Workflow
Structured workflow for collaboratively writing documentation, proposals, and technical specs with Claude through three stages: context gathering, iterative refinement, and reader testing
5.7kanthropicsProductivityCreative00
### Tailored Resume Generator
Analyzes job descriptions and generates tailored resumes with ATS optimization, covering career transitions, senior executives, and recent graduates
3.2kComposioHQProductivityCreative00
---
**Source**: https://github.com/sanjay3290/ai-skills/tree/main/skills/elevenlabs
**Author**: sanjay3290
**License**: https://opensource.org/licenses/Apache-2.0
**GitHub Stars**: 79
**Tags**: text-to-speech, elevenlabs, podcast, audio, tts
Related skills 6
caveman
Ultra-compressed communication mode. Cuts token usage ~75% by speaking like caveman while keeping full technical accuracy. Supports intensity levels: lite, full (default), ultra, wenyan-lite, wenyan-full, wenyan-ultra. Use when user says "caveman mode", "talk like caveman", "use caveman", "less tokens", "be brief", or invokes /caveman. Also auto-triggers when token efficiency is requested.
secure-linux-web-hosting
Use when setting up, hardening, or reviewing a cloud server for self-hosting, including DNS, SSH, firewalls, Nginx, static-site hosting, reverse-proxying an app, HTTPS with Let's Encrypt or ACME clients, safe HTTP-to-HTTPS redirects, or optional post-launch network tuning such as BBR.
readme-i18n
Use when the user wants to translate a repository README, make a repo multilingual, localize docs, add a language switcher, internationalize the README, or update localized README variants in a GitHub-style repository.
lark-shared
Use when first setting up lark-cli, running auth login, switching user/bot identity (--as), handling permission denied or scope errors, needing to update lark-cli, or seeing _notice in JSON output.
improve-codebase-architecture
Find deepening opportunities in a codebase, informed by the domain language in CONTEXT.md and the decisions in docs/adr/. Use when the user wants to improve architecture, find refactoring opportunities, consolidate tightly-coupled modules, or make a codebase more testable and AI-navigable.
paper-context-resolver
Optional RigorPilot helper for README-first deep learning repo reproduction. Use only when the README and repository files leave a narrow reproduction-critical gap and the task is to resolve a specific paper detail such as dataset split, preprocessing, evaluation protocol, checkpoint mapping, or runtime assumption from primary paper sources while recording conflicts. Do not use for general paper summary, repo scanning, environment setup, command execution, title-only paper lookup, or replacin...