Automated-Speech-Recognition-System-for-Spoken-digits-Using-Deep-Learning

Deep Learning, Convolutional Neural Network,Regularisation, Convolutional Neural Network with Residual Connection Designed, implemented and evaluated an Automated Speech Recognition (ASR) system for spoken digits using Deep Learning methods. Extracted audio-based spectrograms from the raw .wav files of speech using digital signal processing in python. Convolutional Neural Network(CNN) modes have been explored. Three models have been applied $-$ first with $4$ CNN blocks with 512 filters with 69.68 percent accuracy, Second with $4$ CNN blocks with 512 filters regularised with 72.42 percent accuracy and the last one is the CNN model with Residual Connection with an outperformance of 74.64 percent. Demonstrated training/validation performance with visualisation during the hyperparameter optimisation in training. Demonstrated performance using the confusion plot for each task.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
ACS61011_Deep_Learning_Assignment_Individual_Project.pdf		ACS61011_Deep_Learning_Assignment_Individual_Project.pdf
ASR.ipynb		ASR.ipynb
LICENSE		LICENSE
README.md		README.md
doing_the_dishes.wav		doing_the_dishes.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Automated-Speech-Recognition-System-for-Spoken-digits-Using-Deep-Learning

About

Uh oh!

Releases

Packages

Languages

License

University-Assignments-Thesis/Deep-Learning-Assignment-Automated-Speech-Recognition-System-for-Spoken-digits

Folders and files

Latest commit

History

Repository files navigation

Automated-Speech-Recognition-System-for-Spoken-digits-Using-Deep-Learning

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages