Browsing by Author "Patil, Hemant A."

Acoustic analysis of musical pillars of vitthala temple, Hampi

Lakshmipriya, V. K. (Dhirubhai Ambani Institute of Information and Communication Technology, 2014)

This thesis is a systematic investigation on the acoustics of musical pillars of Vitthala temple at Hampi, India. The columns of different pillars produce sounds of different musical instruments (in particular, instruments ...

Acoustic-to-articulatory inversion: speech quality assessment and smoothness constraint

Rajpal, Avni (Dhirubhai Ambani Institute of Information and Communication Technology, 2015)

The ability of humans to speak effortlessly, require coordinated movements of various articulators, muscles, etc. This effortless movement contributes towards naturalness, intelligibility and speaker identity in human ...

Analysis of nonlinearity in speech production mechanism for speaker verification: phase-based approach

Agrawal, Purvi (Dhirubhai Ambani Institute of Information and Communication Technology, 2015)

Many of the real-world signal processing problems can be described using linear models, and can be realized as analog or digital filter, time-invariant filters; finite or infinite impulse response (IIR or FIR) filters. In ...

Analysis of voice biometric attacks: detection of synthetic vs natural speech

S, Adarsa (Dhirubhai Ambani Institute of Information and Communication Technology, 2014)

The improvement in text-to-speech (TTS) synthesis also poses the problem of biometric attack on speaker verification system. In this context, it is required to analyse the performance of these system for false acceptance ...

Auditory representation learning

Sailor, Hardik B. (Dhirubhai Ambani Institute of Information and Communication Technology, 2018)

Representation learning (RL) or feature learning has a huge impact in the field of signal processing applications. The goal of the RL approaches is to learn the meaningful representation directly from the data that can be ...

Automatic speech recognition using deep neural networks

Sharma, Manisha (Dhirubhai Ambani Institute of Information and Communication Technology, 2016)

Automatic Speech Recognition (ASR) is an important field of research because ofits widespread use in various fields such as military, health services, day-to-dayactivities, etc. ASR task was earlier done using GMM-HMM ...

Classification of Pathological Infant Cries and Dysarthric Severity-Level

Kachhi, Aastha Bidhenbhai (Dhirubhai Ambani Institute of Information and Communication Technology, 2022)

Vocal communication is the most important part of any individual�s life to convey their needs. Right from the first cry of neonates to the matured adult speech, required proper brain co-ordination. Any kind of lack in ...

Crying for a reason : a signal processing based approach for infant cry analysis and classification

Chittora, Anshu (Dhirubhai Ambani Institute of Information and Communication Technology, 2016)

The present work in this thesis is directed towards understanding the reason of crying of an infant using signal processing approaches. Infant cry analysis and classification is a non-invasive method of analyzing the infant ...

Data Augmentation Using CycleGAN for Children’s ASR43e

Singh, Dipesh Kumar (2021)

Extensive use of voice assistants by children in their day-to-day life activities asks for better performance of Automatic Speech Recognition (ASR) for children’speech. The recent advancements in ASR perform better for ...

Deep Learning for Severity Level-based Classification of Dysarthria

Gupta, Siddhant (2021)

Dysarthria is a motor speech disorder in which muscles required to speak somehow gets damaged or paralyzed resulting in an adverse effect to the articulatory elements in the speech and rendering the output voice unintelligible. ...

Deep learning techniques for speech pathology applications

Purohit, Mirali Virendrabhai (2020)

Human-machine interaction has gained more attention due to its interesting applications in industries and day-to-day life. In recent years, speech technologies have grown rapidly because of the advancement in fields of ...

Design of countermeasures for replay spoof speech attack

Tak, Hemlata (Dhirubhai Ambani Institute of Information and Communication Technology, 2018)

Automatic Speaker Verification (ASV) system is a biometric person authentication system to verify a claimed speaker's identity from his/her voice with the help of machines. The ASV systems are vulnerable to various types ...

Design of countermeasures for spoofed speech detection system

Patel, Tanvina (Dhirubhai Ambani Institute of Information and Communication Technology, 2017)

Automatic Speaker Verification (ASV) systems are vulnerable to speech synthesisand voice conversion techniques due to spoofing attacks.Recently, to encourage thedevelopment of anti-spoofing measures or countermeasures for ...

Design of QbE-STD System: audio representation and matching perspective

Madhavi, Maulik C. (Dhirubhai Ambani Institute of Information and Communication Technology, 2017)

The retrieval of the spoken document and detecting the query (keyword) within the audio document have attained huge research interest. The problem of retrieving audio documents and detecting the query (keyword) using a ...

Design of robust automatic speaker verification system in adverse conditions

Rajpura, Divyesh G. (2020)

The Automatic Speaker Verification (ASV) aims to verify the identity of a person from his/her voice with the help of machines. It has become an essential component of many speech-related applications due to its use as a ...

Design of spoof speech detection system : teager energy-based approach

Kamble, Madhu R. (Dhirubhai Ambani Institute of Information and Communication Technology, 2021)

Automatic Speaker Verification (ASV) systems are vulnerable to various spoofing attacks, namely, Speech Synthesis (SS), Voice Conversion (VC), Replay, and Impersonation. The study of spoofing countermeasures has become ...

Design of syllable-based speech segmentation methods for text-to-speech (TTS) synthesis system for Gujarati

Talesara, Swati (Dhirubhai Ambani Institute of Information and Communication Technology, 2013)

Text-to-speech (TTS) synthesizer has been proved to be an aiding tool for many visually challenged people for reading through hearing feedback. Although there are TTS synthesizers available in English and other languages ...

Design of Voice Privacy System

Prajapati, Gauri P. (2021)

Extensive use of Intelligent Personal Assistants (IPA) and biometrics in our day to day life asks for privacy preservation while dealing with personal data. To that effect, efforts have been made to preserve the personally ...

Development of Countermeasures for Voice Liveness and Spoofed Speech Detection

Chodingala, Piyushkumar Kiritbhai (Dhirubhai Ambani Institute of Information and Communication Technology, 2022)

An Automatic Speaker Verification (ASV) or voice biometric system performs machine based authentication of speakers using voice signals. ASV is a voice biometric system which has applications, such as banking transactions ...

Environmental Sound Classification (ESC) using Handcrafted and Learned Features

Agrawal, Dharmeshkumar Maheshchandra (Dhirubhai Ambani Institute of Information and Communication Technology, 2017)

"Environmental Sound Classification (ESC) is an important research field due to its application in various field such as hearing aids, road surveillance system for security and safety purpose, etc. ESC task was earlier ...