Publications

2025

  1. Preprint
    POWSM: A Phonetic Open Whisper-Style Speech Foundation Model
    Chin-Jou Li, Kalvin Chang, Shikhar Bharadwaj, Eunjung Yeo, Kwanghee Choi, and 3 more authors
    2025
  2. Interspeech
    Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages
    Chin-Jou Li, Eunjung Yeo, Kwanghee Choi, Paula Andrea Pérez-Toro, Masao Someki, and 5 more authors
    In Interspeech, 2025
  3. Preprint
    Prompt-MII: Meta-Learning Instruction Induction for LLMs
    Emily Xiao, Yixiao Zeng, Ada Chen, Chin-Jou Li, Amanda Bertsch, and 1 more author
    2025
  4. ACL
    Efficient Many-Shot In-Context Learning with Dynamic Block-Sparse Attention
    Emily Xiao, Chin-Jou Li, Yilin Zhang, Graham Neubig, and Amanda Bertsch
    In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, 2025

2024

  1. EMBC
    Epileptic Seizure Classification with Patient-level and Video-level Contrastive Pretraining
    Chin-Jou Li, Chien-Chen Chou, Yen-Cheng Shih, Li-Chuan Kuo, Yu-Te Wang, and 4 more authors
    In 2024 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2024
  2. Interspeech Satellite
    Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement
    Shafique Ahmed, Chia-Wei Chen, WenZe Ren, Chin-Jou Li, Ernie Chu, and 5 more authors
    In 3rd COG-MHEAR Workshop on Audio-Visual Speech Enhancement (AVSEC), 2024
  3. Face swapping in seizure videos for patient deidentification
    Chin-Jou Li, Jen-Cheng Hou, Chien-Chen Chou, Yen-Cheng Shih, Stephane Dufau, and 3 more authors
    2024

2023

  1. Artificial Intelligence-Based Face Transformation in Patient Seizure Videos for Privacy Protection
    Jen-Cheng Hou, Chin-Jou Li, Chien-Chen Chou, Yen-Cheng Shih, Si-Lei Fong, and 5 more authors
    2023