Please use this identifier to cite or link to this item:
https://elibrary.khec.edu.np:8080/handle/123456789/419
Title: | Nepali AI Sallakar |
Authors: | Krishnadev Adhikari Danuwar (740318) Kushal Badal (740320) Simanta Karki (740341) Sirish Titaju (740342) Swostika Shrestha (740348) Er.Dinesh Gothe |
Advisor: | Er.DineshGothe |
Keywords: | Mel Frequency Cepstral Coefficient (MFCC), Tacotron, Fast Fourier Transform, Recurrent Neural Network(RNN), Long Short Term Memory(LSTM), Gaussian Mixture Model. |
Issue Date: | Aug-2022 |
College Name: | Khwopa Engineering College |
Level: | BE |
Department Name: | Department of Computer |
Abstract: | This project ”Nepali AI Sallakar” is about how Artificial Intelligence (AI) can be used to classify the different age group using speech signal. Age estimation based on human’s speech features is an interesting subject in Automatic Speech Recognition (ASR) systems. In age estimation, like other speech processing systems, we encounter with two main challenges: finding an appropriate procedure for feature extraction, and selecting a reliable method for pattern classification. In this project we have used Mel Frequency Cepstral Coefficient (MFCC) for feature extraction. And we have used a special type of Recurrent Neural Network(RNN) called as Long Short Term Memory(LSTM).We have defined 6 age groups of speakers namely teens, twenties, thirties up to sixties. Furthermore, we used this model in recommendation system. For this we take input as audio of user and according to the gender and age group predicted from the model we recommend different things to the user. |
URI: | https://elibrary.khec.edu.np/handle/123456789/419 |
Appears in Collections: | Computer Report |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Nepali AI Sallakar.pdf Restricted Access | 19.96 MB | Adobe PDF | View/Open Request a copy |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.