Here is a list of my talks and presentations (including presenting work by other authors in reading groups):

  • Listening to Multi-talker Conversations: Modular and End-to-end Perspectives Slides Video
    PhD Thesis Defense
    January 26, 2024

  • VoiceBox: Text-guided multi-lingual speech generation at scale Slides
    Speech Technologies Reading Group
    September 22, 2023

  • Listening to Multi-talker Conversations: Modular and End-to-end Perspectives Slides
    Invited talk at NVIDIA Speech group
    August 18, 2023

  • FLASH Attention Slides
    Speech Technologies Reading Group
    April 14, 2023

  • Target Speaker Methods for Speech Recognition Slides
    CLSP Seminar
    March 27, 2023

  • Training RNN-T models without memory bottleneck Slides
    Speech Technologies Reading Group
    October 14, 2022

  • GBO presentation Slides
    Malone 228 (May 04, 2022)

  • Overlap-aware Speaker Diarization: Methods and Ensembles
    ISCA SIG-ML Seminar (May 05, 2021): Video Slides
    CLSP Seminar (January 29, 2021): Slides

  • TS-ASR: Speaker Beam and Voice Filter Slides
    Speech Technologies Reading Group
    October 02, 2020

  • Informed Target Speaker ASR Slides
    JSALT 2020 Closing Presentation
    August 06, 2020

  • Target Speaker - Voice Activity Detection Paper Slides
    Speech Technologies Reading Group
    May 29, 2020

  • The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge Video Slides
    CHiME-6 Virtual Workshop
    May 04, 2020

  • CLSP Seminar Lightning Talk Slides
    CLSP Seminar
    April 03, 2020

  • Imputer: Sequence Modeling via Imputation and Dynamic Programming Paper Slides
    Speech Technologies Reading Group
    Barton 225, 3101 Wyman Park Dr, Baltimore
    March 06, 2020

  • Transformer ASR with Contextual Block Processing Paper Slides
    Speech Technologies Reading Group
    Hackerman 320, 3101 Wyman Park Dr, Baltimore
    November 04, 2019

  • Joint CTC-Attention for ASR using Multi-task Learning Paper Slides
    Information Extraction Lightning Talk
    Hackerman 320, 3101 Wyman Park Dr, Baltimore
    May 02, 2019

  • Contrastive Predictive Coding Paper Slides
    Speech Technologies Reading Group
    Barton 225, 3101 Wyman Park Dr, Baltimore
    April 29, 2019

  • Dataset Shift in NLP Paper Slides
    NLP Reading Group
    Hackerman 306, 3101 Wyman Park Dr, Baltimore
    April 17, 2019

  • Attention-based Models for ASR Paper Slides
    Speech Technologies Reading Group
    Barton 225, 3101 Wyman Park Dr, Baltimore
    March 11, 2019