TTS/STT Dataset Builder

Create high-quality speech datasets for Text-to-Speech and Speech-to-Text models

A collaborative platform for researchers to build and manage speech datasets. Upload documents, videos, and create transcription datasets for Romanian language models.

Features:

  • 📄 PDF & Text document processing
  • 🎥 Video transcription with Whisper
  • 🤖 Vision AI for complex documents
  • 📊 Dataset management & export