NEW Browse AI tools across categories — updated daily. See what's new →

Google Cloud Text To Speech

Convert text and documents to audio using Google Cloud TTS API with Neural2/WaveNet/Studio voices, 40+ languages, and dual-host podcast generation

Version1.0.0
LicenseApache-2.0
Token count~529
UpdatedJun 5, 2026

Install

Quick install

via npx skills · works with 57+ agents
npx skills add https://github.com/sanjay3290/ai-skills/tree/main/skills/google-tts
Or pick agent:
npx skills add sanjay3290/ai-skills --skill "Google Cloud Text-to-Speech" --agent claude-code
npx skills add sanjay3290/ai-skills --skill "Google Cloud Text-to-Speech" --agent cursor
npx skills add sanjay3290/ai-skills --skill "Google Cloud Text-to-Speech" --agent codex
npx skills add sanjay3290/ai-skills --skill "Google Cloud Text-to-Speech" --agent opencode
npx skills add sanjay3290/ai-skills --skill "Google Cloud Text-to-Speech" --agent github-copilot
npx skills add sanjay3290/ai-skills --skill "Google Cloud Text-to-Speech" --agent windsurf
More install options

Shorthand — useful for multi-skill repos:

npx skills add sanjay3290/ai-skills --skill "Google Cloud Text-to-Speech"

Manual — clone the repo and drop the folder into your agent's skills directory:

git clone https://github.com/sanjay3290/ai-skills.git
cp -r ai-skills/skills/google-tts ~/.claude/skills/
How to use: Once installed, ask your agent to "use the Google Cloud Text-to-Speech skill" or describe what you want (e.g. "Convert text and documents to audio using Google Cloud TTS API with Neural2/Wave"). Requires Node.js 18+.

Google Cloud Text-to-Speech

Convert text and documents to audio using Google Cloud TTS API with Neural2/WaveNet/Studio voices, 40+ languages, and dual-host podcast generation

What is it?
Convert text and documents to audio using Google Cloud TTS API with Neural2/WaveNet/Studio voices, 40+ languages, and dual-host podcast generation Built for use cases involving text-to-speech, google-cloud, tts, audio, podcast.

How to use it?

API key via GOOGLE_TTS_API_KEY env var or skills/google-tts/config.json with {"api_key": "..."}. Requires ffmpeg for multi-chunk documents. Optional: pip install PyPDF2 python-docx for PDF/DOCX.

Key Features

  • Convert text and documents to audio using Google Cloud TTS API with Neural2/WaveNet/Studio voices, 40+ languages, and dual-host podcast generation
  • Seamless integration with Claude's development workflow
  • Comprehensive guidelines and best practices for google cloud text-to-speechView on GitHub

GitHub Stats

StarsForksLast UpdateAuthorsanjay3290LicenseApache-2.0Version1.0.0

Categories

CreativeProductivity

Tags

text-to-speechgoogle-cloudttsaudiopodcast

Features

Related Skills

More from Creative

ElevenLabs Text-to-Speech

Convert text and documents to high-quality audio using ElevenLabs TTS API, supporting single narrator and dual-host podcast generation modes

79sanjay3290CreativeProductivity00

Doc Co-Authoring Workflow

Structured workflow for collaboratively writing documentation, proposals, and technical specs with Claude through three stages: context gathering, iterative refinement, and reader testing

5.7kanthropicsProductivityCreative00

Tailored Resume Generator

Analyzes job descriptions and generates tailored resumes with ATS optimization, covering career transitions, senior executives, and recent graduates

3.2kComposioHQProductivityCreative00

---

Source: https://github.com/sanjay3290/ai-skills/tree/main/skills/google-tts
Author: sanjay3290
License: https://opensource.org/licenses/Apache-2.0
GitHub Stars: 79
Tags: text-to-speech, google-cloud, tts, audio, podcast

SKILL.md source

---
name: Google Cloud Text-to-Speech
description: Convert text and documents to audio using Google Cloud TTS API with Neural2/WaveNet/Studio voices, 40+ languages, and dual-host podcast generation
---

# Google Cloud Text-to-Speech

Convert text and documents to audio using Google Cloud TTS API with Neural2/WaveNet/Studio voices, 40+ languages, and dual-host podcast generation

What is it?
Convert text and documents to audio using Google Cloud TTS API with Neural2/WaveNet/Studio voices, 40+ languages, and dual-host podcast generation Built for use cases involving text-to-speech, google-cloud, tts, audio, podcast.

## How to use it?
API key via `GOOGLE_TTS_API_KEY` env var or `skills/google-tts/config.json` with `{"api_key": "..."}`.
Requires `ffmpeg` for multi-chunk documents. Optional: `pip install PyPDF2 python-docx` for PDF/DOCX.

## Key Features

* Convert text and documents to audio using Google Cloud TTS API with Neural2/WaveNet/Studio voices, 40+ languages, and dual-host podcast generation
* Seamless integration with Claude's development workflow
* Comprehensive guidelines and best practices for google cloud text-to-speechView on GitHub

### GitHub Stats
StarsForksLast UpdateAuthorsanjay3290LicenseApache-2.0Version1.0.0

### Categories
CreativeProductivity

### Tags
text-to-speechgoogle-cloudttsaudiopodcast

### Features

## Related Skills
More from Creative

### ElevenLabs Text-to-Speech
Convert text and documents to high-quality audio using ElevenLabs TTS API, supporting single narrator and dual-host podcast generation modes

79sanjay3290CreativeProductivity00

### Doc Co-Authoring Workflow
Structured workflow for collaboratively writing documentation, proposals, and technical specs with Claude through three stages: context gathering, iterative refinement, and reader testing

5.7kanthropicsProductivityCreative00

### Tailored Resume Generator
Analyzes job descriptions and generates tailored resumes with ATS optimization, covering career transitions, senior executives, and recent graduates

3.2kComposioHQProductivityCreative00

---

**Source**: https://github.com/sanjay3290/ai-skills/tree/main/skills/google-tts
**Author**: sanjay3290
**License**: https://opensource.org/licenses/Apache-2.0
**GitHub Stars**: 79
**Tags**: text-to-speech, google-cloud, tts, audio, podcast

Related skills 6

caveman

★ Featured

Ultra-compressed communication mode. Cuts token usage ~75% by speaking like caveman while keeping full technical accuracy. Supports intensity levels: lite, full (default), ultra, wenyan-lite, wenyan-full, wenyan-ultra. Use when user says "caveman mode", "talk like caveman", "use caveman", "less tokens", "be brief", or invokes /caveman. Also auto-triggers when token efficiency is requested.

juliusbrussee 167k
Development

secure-linux-web-hosting

★ Featured

Use when setting up, hardening, or reviewing a cloud server for self-hosting, including DNS, SSH, firewalls, Nginx, static-site hosting, reverse-proxying an app, HTTPS with Let's Encrypt or ACME clients, safe HTTP-to-HTTPS redirects, or optional post-launch network tuning such as BBR.

xixu-me 155k
Development

readme-i18n

★ Featured

Use when the user wants to translate a repository README, make a repo multilingual, localize docs, add a language switcher, internationalize the README, or update localized README variants in a GitHub-style repository.

xixu-me 155k
Development

lark-shared

★ Featured

Use when first setting up lark-cli, running auth login, switching user/bot identity (--as), handling permission denied or scope errors, needing to update lark-cli, or seeing _notice in JSON output.

larksuite 155k
Development

improve-codebase-architecture

★ Featured

Find deepening opportunities in a codebase, informed by the domain language in CONTEXT.md and the decisions in docs/adr/. Use when the user wants to improve architecture, find refactoring opportunities, consolidate tightly-coupled modules, or make a codebase more testable and AI-navigable.

mattpocock 151k
Development

paper-context-resolver

★ Featured

Optional RigorPilot helper for README-first deep learning repo reproduction. Use only when the README and repository files leave a narrow reproduction-critical gap and the task is to resolve a specific paper detail such as dataset split, preprocessing, evaluation protocol, checkpoint mapping, or runtime assumption from primary paper sources while recording conflicts. Do not use for general paper summary, repo scanning, environment setup, command execution, title-only paper lookup, or replacin...

lllllllama 127k
Development