Single channel or multi-channel audio control interface LH Kim, E Visser, R Peri, PL Ton, JP Toman, T Schultz, J Zheng US Patent 10,051,364, 2018 | 107 | 2018 |
Adversarial attack and defense strategies for deep speaker recognition systems A Jati, CC Hsu, M Pal, R Peri, W AbdAlmageed, S Narayanan Computer Speech & Language 68, 101199, 2021 | 76 | 2021 |
Automated evaluation of psychotherapy skills using speech and language technologies N Flemotomos, VR Martinez, Z Chen, K Singla, V Ardulov, R Peri, ... Behavior Research Methods 54 (2), 690-711, 2022 | 73 | 2022 |
Deep neural net based filter prediction for audio event classification and extraction E Visser, Y Guo, LH Kim, R Peri, S Zhang US Patent 9,666,183, 2017 | 56 | 2017 |
Virtual, augmented, and mixed reality E Visser, LH Kim, R Peri US Patent App. 15/238,591, 2018 | 49 | 2018 |
Robust speaker recognition using unsupervised adversarial invariance R Peri, M Pal, A Jati, K Somandepalli, S Narayanan ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 35 | 2020 |
User-level differential privacy against attribute inference attack of speech emotion recognition in federated learning T Feng, R Peri, S Narayanan arXiv preprint arXiv:2204.02500, 2022 | 30 | 2022 |
Adversarial defense for deep speaker recognition using hybrid adversarial training M Pal, A Jati, R Peri, CC Hsu, W AbdAlmageed, S Narayanan ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 27 | 2021 |
Speaker diarization using latent space clustering in generative adversarial network M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 26 | 2020 |
Meta-learning with latent space clustering in generative adversarial network for speaker diarization M Pal, M Kumar, R Peri, TJ Park, SH Kim, C Lord, S Bishop, S Narayanan IEEE/ACM transactions on audio, speech, and language processing 29, 1204-1219, 2021 | 24 | 2021 |
Drone flight control E Visser, LH Kim, RDJB Castillo, S Zhang, R Peri US Patent 10,379,534, 2019 | 20 | 2019 |
Method, system and article of manufacture for processing spatial audio LH Kim, R Peri, E Visser US Patent 9,578,439, 2017 | 18 | 2017 |
Disentanglement for audio-visual emotion recognition using multitask setup R Peri, S Parthasarathy, C Bradshaw, S Sundaram ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 15 | 2021 |
Multi-Task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech. A Jati, R Peri, M Pal, TJ Park, N Kumar, R Travadi, PG Georgiou, ... Interspeech, 2463-2467, 2019 | 15 | 2019 |
Collaborative audio processing LH Kim, E Visser, R Peri US Patent 9,706,300, 2017 | 15 | 2017 |
An empirical analysis of information encoded in disentangled neural speaker representations R Peri, H Li, K Somandepalli, A Jati, S Narayanan arXiv preprint arXiv:2002.03520, 2020 | 13 | 2020 |
Cloud-based processing using local device provided sensor data and labels E Visser, M Jin, LH Kim, R Peri, S Zhang US Patent App. 15/273,496, 2017 | 13 | 2017 |
The Second DIHARD Challenge: System Description for USC-SAIL Team. TJ Park, M Kumar, N Flemotomos, M Pal, R Peri, R Lahiri, PG Georgiou, ... INTERSPEECH, 998-1002, 2019 | 11 | 2019 |
A study of bias mitigation strategies for speaker recognition R Peri, K Somandepalli, S Narayanan Computer Speech & Language 79, 101481, 2023 | 10 | 2023 |
Single-channel or multi-channel audio control interface LH Kim, E Visser, R Peri, PL Ton, JP Toman, T Schultz, J Zheng US Patent 10,073,607, 2018 | 9 | 2018 |