AI-powered audio-to-text transcription web application that converts speech into text using a React frontend and Spring Boot backend. Users can upload audio files and receive accurate text transcriptions powered by AI speech recognition APIs.
- Upload audio files for transcription
- Convert speech to text using AI
- Modern responsive UI
- Full-stack architecture (React + Spring Boot)
- Loading indicator while processing audio
- REST API integration
- Clean and responsive user interface
Client (React)
│
▼
Spring Boot REST API
│
▼
AI Speech-to-Text API
│
▼
Transcription Result
- React
- Axios
- CSS
- Vite
- Spring Boot
- Java
- REST API
- Maven
- Speech-to-Text API (Whisper / Groq / OpenAI)
Directory structure:
└── ai-audio-transcriber/
├── README.md
├── Audio-Transcriber/
│ ├── mvnw
│ ├── mvnw.cmd
│ ├── pom.xml
│ ├── src/
│ │ ├── main/
│ │ │ ├── java/
│ │ │ │ └── com/
│ │ │ │ └── audioTranscriber/
│ │ │ │ ├── Application.java
│ │ │ │ ├── TranscriptionController.java
│ │ │ │ └── WebConfig.java
│ │ │ └── resources/
│ │ │ └── application.properties
│ │ └── test/
│ │ └── java/
│ │ └── com/
│ │ └── audioTranscriber/
│ │ └── ApplicationTests.java
│ └── .mvn/
│ └── wrapper/
│ └── maven-wrapper.properties
└── audio-transcriber-frontend/
├── README.md
├── eslint.config.js
├── index.html
├── package.json
├── vite.config.js
└── src/
├── App.css
├── App.jsx
├── AudioUploader.jsx
├── index.css
└── main.jsx
git clone https://github.com/himanshux19/ai-audio-transcriber.git
cd ai-audio-transcriberNavigate to backend folder:
cd Audio-TranscriberRun the backend server:
./mvnw spring-boot:runBackend runs on:
http://localhost:8080
Navigate to frontend folder:
cd audio-transcriber-frontendInstall dependencies:
npm installRun the frontend:
npm run devFrontend runs on:
http://localhost:5173
API keys are not included in this repository for security.
Create a .env file and add:
OPENAI_API_KEY=your_api_key_here
or
GROQ_API_KEY=your_api_key_here
Upload an audio file → AI processes the speech → Text transcription appears instantly.
- Real-time microphone transcription
- Download transcript feature
- Copy-to-clipboard button
- Multi-language support
- Drag-and-drop audio upload
Himanshu Singh