publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2025

  1. PCC
    PCC_Preview.png
    Self-Supervised Cross-View Correspondence with Predictive Cycle Consistency
    Alan Baade, and Changan Chen
    In Submission to CVPR 2025 , 2025
  2. SyllableLM
    SyllableLM_Preview.png
    SyllableLM: Learning Coarse Semantic Units for Speech Language Models
    Alan Baade, Puyuan Peng, and David Harwath
    In Submission to ICLR 2025. OpenReview , 2025

2024

  1. Disentangled NCLM
    NCLM_Preview.png
    Neural Codec Language Models for Disentangled and Textless Voice Conversion
    Alan Baade, Puyuan Peng, and David Harwath
    Interspeech , 2024

2022

  1. MAE-AST
    MAEAST_Preview.png
    MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
    Alan Baade, Puyuan Peng, and David Harwath
    Interspeech , 2022