ASR-transformersNN

ASR system using transformers neural networks from scratch

This is a project that was with the intention for DOT.

Our own ASR with trasnformers Neural Networks for DOT, the problem was the Dataset, it contains only 500 hours for training "Librispeech clean 360 and 100", you can see the model in kaggle.

https://www.kaggle.com/code/bernardoolisan/new-asr-dot?scriptVersionId=93010436

To make a model like Google's ASR, you will need more than 10000 hours for training, there's a lib called GigaSpeech that contains more than 10000 hours, but is more than 1Tb, you will need to have a better pc, at least 1 epoch is going to take 12 or more hours.

https://github.com/SpeechColab/GigaSpeech

But the model at the end works perfect:)

the model.model only contains 100 epochs of training with a loss aprox of 0.65

Tutorial

The tutorial is on Medium.com here https://medium.com/@bernardoolisan/asr-speech-recognition-creation-with-tranformers-nn-30a8b8af1b6e Go Clap that post!

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Speechrecognition		Speechrecognition
speechrecognitionE.model		speechrecognitionE.model
ASR.py		ASR.py
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASR-transformersNN

Tutorial

Try to use a good mic...

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ASR-transformersNN

Tutorial

Try to use a good mic...

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages