Nemesys Labs Text-to-Speech Platform is a free AI-powered speech synthesis service that converts written text into natural-sounding speech via a simple HTTP API. Designed for quick integration, it supports multiple voices and languages, enabling developers to generate audio on demand with a single request. The platform emphasizes ease of use, rapid results, and broad language coverage for applications such as podcasts, voice assistants, e-learning, and accessibility tools.
How to Use Nemesys Labs Text-to-Speech
- Prepare your request: provide the text you want to convert and optionally select a voice.
- Send a POST request to the API endpoint:
https://api.nemesyslabs.com/api/v1/text-to-speech with a JSON body.
- Receive audio: the API returns an MP3 audio file containing the spoken text. You can then save or stream the audio in your application.
Example Request
curl --location 'https://api.nemesyslabs.com/api/v1/text-to-speech' \
--header 'Content-Type: application/json' \
--header 'Authorization: Bearer ••••••' \
--data '{
"text": "Welcome to Nemesys Labs, we try our best to democratize state-of-the-art speech synthesis like this one, have fun!",
"voiceId": "Alice"
}'
Response: 200 with an MP3 audio file.
Voices and Languages
- Supports over 21 languages with multiple voices, including: Eva, Rafael, Matthew, Alice, Alex, Michael, Jack, and more.
Endpoints and Parameters
- Endpoint:
POST /api/v1/text-to-speech
- Required:
text (up to 50,000 characters, varies by plan)
- Optional:
voiceId (defaults to Alice)
- Response: MP3 audio file
Voice Options
- Available voices: Eva, Rafael, Matthew, Alice, Alex, Michael, Jack, Ava, Carmen, Emily, Elena, Emma, and others.
Use Cases
- Podcast narration, voice-overs, accessibility features, e-learning narration, dynamic content generation, and more.
Safety and Usage Considerations
- Ensure you have appropriate rights to the text being converted and comply with applicable laws and platform terms.
Core Features
- Quick HTTP API access with a single POST request
- Support for 21+ languages and multiple voices
- High-quality, natural-sounding speech synthesis
- Simple JSON-based request payloads
- MP3 audio output ready for integration
- Minimal setup with no card required for basic usage