Please use this identifier to cite or link to this item: https://elibrary.khec.edu.np:8080/handle/123456789/419
Title: Nepali AI Sallakar
Authors: Krishnadev Adhikari Danuwar (740318)
Kushal Badal (740320)
Simanta Karki (740341)
Sirish Titaju (740342)
Swostika Shrestha (740348)
Er.Dinesh Gothe
Advisor: Er.DineshGothe
Keywords: Mel Frequency Cepstral Coefficient (MFCC), Tacotron, Fast Fourier Transform, Recurrent Neural Network(RNN), Long Short Term Memory(LSTM), Gaussian Mixture Model.
Issue Date: Aug-2022
College Name: Khwopa Engineering College
Level: BE
Department Name: Department of Computer
Abstract: This project ”Nepali AI Sallakar” is about how Artificial Intelligence (AI) can be used to classify the different age group using speech signal. Age estimation based on human’s speech features is an interesting subject in Automatic Speech Recognition (ASR) systems. In age estimation, like other speech processing systems, we encounter with two main challenges: finding an appropriate procedure for feature extraction, and selecting a reliable method for pattern classification. In this project we have used Mel Frequency Cepstral Coefficient (MFCC) for feature extraction. And we have used a special type of Recurrent Neural Network(RNN) called as Long Short Term Memory(LSTM).We have defined 6 age groups of speakers namely teens, twenties, thirties up to sixties. Furthermore, we used this model in recommendation system. For this we take input as audio of user and according to the gender and age group predicted from the model we recommend different things to the user.
URI: https://elibrary.khec.edu.np/handle/123456789/419
Appears in Collections:Computer Report

Files in This Item:
File Description SizeFormat 
Nepali AI Sallakar.pdf
  Restricted Access
19.96 MBAdobe PDFThumbnail
View/Open Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.