Skip to content

Himanshux19/AI-Audio-Transcriber

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🎧 AI Audio Transcriber

AI-powered audio-to-text transcription web application that converts speech into text using a React frontend and Spring Boot backend. Users can upload audio files and receive accurate text transcriptions powered by AI speech recognition APIs.

🚀 Features

  • Upload audio files for transcription
  • Convert speech to text using AI
  • Modern responsive UI
  • Full-stack architecture (React + Spring Boot)
  • Loading indicator while processing audio
  • REST API integration
  • Clean and responsive user interface

🏗️ Project Architecture

Client (React)
      │
      ▼
Spring Boot REST API
      │
      ▼
AI Speech-to-Text API
      │
      ▼
Transcription Result

🛠️ Tech Stack

Frontend

  • React
  • Axios
  • CSS
  • Vite

Backend

  • Spring Boot
  • Java
  • REST API
  • Maven

AI Integration

  • Speech-to-Text API (Whisper / Groq / OpenAI)

📂 Project Structure

Directory structure:
└── ai-audio-transcriber/
    ├── README.md
    ├── Audio-Transcriber/
    │   ├── mvnw
    │   ├── mvnw.cmd
    │   ├── pom.xml
    │   ├── src/
    │   │   ├── main/
    │   │   │   ├── java/
    │   │   │   │   └── com/
    │   │   │   │       └── audioTranscriber/
    │   │   │   │           ├── Application.java
    │   │   │   │           ├── TranscriptionController.java
    │   │   │   │           └── WebConfig.java
    │   │   │   └── resources/
    │   │   │       └── application.properties
    │   │   └── test/
    │   │       └── java/
    │   │           └── com/
    │   │               └── audioTranscriber/
    │   │                   └── ApplicationTests.java
    │   └── .mvn/
    │       └── wrapper/
    │           └── maven-wrapper.properties
    └── audio-transcriber-frontend/
        ├── README.md
        ├── eslint.config.js
        ├── index.html
        ├── package.json
        ├── vite.config.js
        └── src/
            ├── App.css
            ├── App.jsx
            ├── AudioUploader.jsx
            ├── index.css
            └── main.jsx

⚙️ Installation & Setup

1️⃣ Clone the Repository

git clone https://github.com/himanshux19/ai-audio-transcriber.git
cd ai-audio-transcriber

2️⃣ Backend Setup (Spring Boot)

Navigate to backend folder:

cd Audio-Transcriber

Run the backend server:

./mvnw spring-boot:run

Backend runs on:

http://localhost:8080

3️⃣ Frontend Setup (React)

Navigate to frontend folder:

cd audio-transcriber-frontend

Install dependencies:

npm install

Run the frontend:

npm run dev

Frontend runs on:

http://localhost:5173

🔐 Environment Variables

API keys are not included in this repository for security.

Create a .env file and add:

OPENAI_API_KEY=your_api_key_here

or

GROQ_API_KEY=your_api_key_here

📸 Demo

Upload an audio file → AI processes the speech → Text transcription appears instantly.

image

💡 Future Improvements

  • Real-time microphone transcription
  • Download transcript feature
  • Copy-to-clipboard button
  • Multi-language support
  • Drag-and-drop audio upload

👨‍💻 Author

Himanshu Singh

GitHub

About

AI-powered audio transcription web app that converts speech to text using a React frontend and Spring Boot backend.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors