🎧 AI Audio Transcriber

AI-powered audio-to-text transcription web application that converts speech into text using a React frontend and Spring Boot backend. Users can upload audio files and receive accurate text transcriptions powered by AI speech recognition APIs.

🚀 Features

Upload audio files for transcription
Convert speech to text using AI
Modern responsive UI
Full-stack architecture (React + Spring Boot)
Loading indicator while processing audio
REST API integration
Clean and responsive user interface

🏗️ Project Architecture

Client (React)
      │
      ▼
Spring Boot REST API
      │
      ▼
AI Speech-to-Text API
      │
      ▼
Transcription Result

🛠️ Tech Stack

Frontend

React
Axios
CSS
Vite

Backend

Spring Boot
Java
REST API
Maven

AI Integration

Speech-to-Text API (Whisper / Groq / OpenAI)

📂 Project Structure

Directory structure:
└── ai-audio-transcriber/
    ├── README.md
    ├── Audio-Transcriber/
    │   ├── mvnw
    │   ├── mvnw.cmd
    │   ├── pom.xml
    │   ├── src/
    │   │   ├── main/
    │   │   │   ├── java/
    │   │   │   │   └── com/
    │   │   │   │       └── audioTranscriber/
    │   │   │   │           ├── Application.java
    │   │   │   │           ├── TranscriptionController.java
    │   │   │   │           └── WebConfig.java
    │   │   │   └── resources/
    │   │   │       └── application.properties
    │   │   └── test/
    │   │       └── java/
    │   │           └── com/
    │   │               └── audioTranscriber/
    │   │                   └── ApplicationTests.java
    │   └── .mvn/
    │       └── wrapper/
    │           └── maven-wrapper.properties
    └── audio-transcriber-frontend/
        ├── README.md
        ├── eslint.config.js
        ├── index.html
        ├── package.json
        ├── vite.config.js
        └── src/
            ├── App.css
            ├── App.jsx
            ├── AudioUploader.jsx
            ├── index.css
            └── main.jsx

⚙️ Installation & Setup

1️⃣ Clone the Repository

git clone https://github.com/himanshux19/ai-audio-transcriber.git
cd ai-audio-transcriber

2️⃣ Backend Setup (Spring Boot)

Navigate to backend folder:

cd Audio-Transcriber

Run the backend server:

./mvnw spring-boot:run

Backend runs on:

http://localhost:8080

3️⃣ Frontend Setup (React)

Navigate to frontend folder:

cd audio-transcriber-frontend

Install dependencies:

npm install

Run the frontend:

npm run dev

Frontend runs on:

http://localhost:5173

🔐 Environment Variables

API keys are not included in this repository for security.

Create a .env file and add:

OPENAI_API_KEY=your_api_key_here

or

GROQ_API_KEY=your_api_key_here

📸 Demo

Upload an audio file → AI processes the speech → Text transcription appears instantly.

💡 Future Improvements

Real-time microphone transcription
Download transcript feature
Copy-to-clipboard button
Multi-language support
Drag-and-drop audio upload

👨‍💻 Author

Himanshu Singh

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Audio-Transcriber		Audio-Transcriber
audio-transcriber-frontend		audio-transcriber-frontend
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎧 AI Audio Transcriber

🚀 Features

🏗️ Project Architecture

🛠️ Tech Stack

Frontend

Backend

AI Integration

📂 Project Structure

⚙️ Installation & Setup

1️⃣ Clone the Repository

2️⃣ Backend Setup (Spring Boot)

3️⃣ Frontend Setup (React)

🔐 Environment Variables

📸 Demo

💡 Future Improvements

👨‍💻 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎧 AI Audio Transcriber

🚀 Features

🏗️ Project Architecture

🛠️ Tech Stack

Frontend

Backend

AI Integration

📂 Project Structure

⚙️ Installation & Setup

1️⃣ Clone the Repository

2️⃣ Backend Setup (Spring Boot)

3️⃣ Frontend Setup (React)

🔐 Environment Variables

📸 Demo

💡 Future Improvements

👨‍💻 Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages