new
Release
New Features & Major Enhancements
- Comprehensive Onboarding System:We've built a complete onboarding experience that guides new users through their first narration project. The multi-stage flow includes voice source selection (clone, public voices, or celebrity voices), file upload with SRT support, and project creation. Users can now choose from existing cloned voices or record new ones directly in the onboarding flow, with smart voice recommendations based on their use case.
- Timeline Audio Editor:Introduced a powerful timeline editor for narration projects with visual waveform display, drag-and-drop paragraph reordering, and precise gap adjustments between segments. The editor features a transport bar with playback controls, zoom functionality (keyboard shortcuts supported), auto-scroll during playback, and a ruler for precise timing. Users can now see their entire narration project laid out visually and make fine-tuned adjustments.
- Smart Tags for Natural Speech:Added emotion tags to give users fine control over narration delivery. The system intelligently parses and serializes these tags, ensuring consistent formatting across different voice providers. Emotion tags let you add expressiveness to specific text segments.
- Export System with Configuration:Implemented a complete export pipeline with configurable options including subtitle generation (SRT format). Users can track export history, view latest export status, and download completed exports. The system supports unified concatenation and export paths with proper silence insertion between paragraphs.
- Voice Discovery by Category:Launched voice categories (formerly "occasions") that let users browse voices by context like Super Bowl, romantic, scary/creepy, and radio announcer styles. The system includes intelligent voice categorization and analysis, making it easier to find the perfect voice for any project.
User Experience & Interface Polish
- Refined Paragraph Toolbar:Redesigned the paragraph toolbar with smart tag insertion buttons, regeneration controls, and improved visual feedback. The toolbar now shows generation status with breathing animations and disables actions appropriately during processing.
- Enhanced Voice Selection:Improved voice selection across the platform with better filtering by gender, language, and use case. Added hover-to-play functionality on voice avatars, expanded language options, and integrated popular voice lists in the onboarding flow.
- Better Audio Playback:Enhanced the shared audio player with proper cleanup logic, paywall triggers, and 'ended' event handling. Implemented auto-pause during gap adjustments and improved seek functionality in the timeline editor.
- Zoom-to-Fit Functionality:Added zoom controls in the timeline editor with keyboard shortcuts, allowing users to quickly adjust their view to see the entire project or focus on specific segments.
Stability & Reliability
- Performance Optimizations:Implemented smart caching for narration previews and paragraph history, reducing generation times and costs. Improved voice loading performance with better state management using centralized stores. Enhanced scrolling behavior with custom scroll containers.
- Robust Error Handling:Added comprehensive error codes for voice limits, and public voice downloads. Improved error messages to be more actionable and user-friendly.