Every day I post short descriptions (well titles, really) of recent speech articles
Assistant Prof of CS at the University of Waterloo, Faculty and Canada CIFAR AI Chair at the Vector Institute. Joining NYU Courant in September 2026. Co-EiC of TMLR. My group is The Salon. Privacy, robustness, machine learning.
http://www.gautamkamath.com
Cognitive scientist, linguist, phonetician at the University of Zurich Dept. of Computational Linguistics
Professor, Programmer in NYC.
Cornell, Hugging Face 🤗
Research Director, Founding Faculty, Canada CIFAR AI Chair @VectorInst.
Full Prof @UofT - Statistics and Computer Sci. (x-appt) danroy.org
I study assumption-free prediction and decision making under uncertainty, with inference emerging from optimality.
Reverse engineering neural networks at Anthropic. Previously Distill, OpenAI, Google Brain.Personal account.
Cofounded and lead PyTorch at Meta. Also dabble in robotics at NYU.
AI is delicious when it is accessible and open-source.
http://soumith.ch
https://Answer.AI & https://fast.ai founding CEO; previous: hon professor @ UQ; leader of masks4all; founding CEO Enlitic; founding president Kaggle; various other stuff…
Once was speech technologist - Water of Leith, Edinburgh - Born 320.23 ppm
I created pyannote open source toolkit.
Co-founder and CSO at pyannoteAI
Full professor of inclusive speech communication at TU Delft, The Netherlands. Former president of the International Speech Communication Association (ISCA). Mother of 3🌈
Speech and audio research scientist @MERL. saneworkshop.org co-founder. IguanaTex developer.
🌐 jonathanleroux.org
🐙 github.com/Jonathan-LeRoux/
🎓 scholar.google.com/citations?user=aUpxty8AAAAJ&hl=en
PhD Student @CMU, Speech AI Research
https://pyf98.github.io/
phd @ berkeley. #speechproc.
@ltiatcmu.bsky.social '23 + @scsatcmu.bsky.social '22 + ex-Amazon. eng、華語、tâi-gú
AI + Speech @ Nvidia. PhD @ AGH-UST, ex-JHU. My interests: speech processing technologies; ML/AI software engineering. Building OSS for Speech AI.
Master's student @ltiatcmu.bsky.social, working on speech AI at @shinjiw.bsky.social
PhD Student @ltiatcmu.bsky.social
I work in speech processing.
wanchichen.github.io
https://kyutai.org/ Open-Science AI Research Lab based in Paris
I build tools that propel communities forward
Working towards the safe development of AI for the benefit of all at Université de Montréal, LawZero and Mila.
A.M. Turing Award Recipient and most-cited AI researcher.
https://lawzero.org/en
https://yoshuabengio.org/profile/
Research Scientist at DeepMind. Opinions my own. Inventor of GANs. Lead author of http://www.deeplearningbook.org . Founding chairman of www.publichealthactionnetwork.org
Co-Founder & CEO, Sakana AI 🎏 → @sakanaai.bsky.social
https://sakana.ai/careers
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
Researcher@Meta Reality Labs, working on generative models, speech enhancement, speech recognition, TTS, etc.
https://nateanl.github.io/
Beatriz Galindo Research Fellow at UPF (Barcelona) | L1 & L2 Acquisition | Speech Perception | Spoken Word Recognition | Hobbies: ⚽️🏂🏈🍺☕️
Researcher in machine learning (speech recognition / private federated learning) in Cambridge
Research scientist at Duolingo (natural language processing and speech recognition). Mostly posts funny AI fails, sometimes also cool linguistics facts and rockets.
speech recognition @ naver
https://jaesong.github.io/
Research engineer, Speech and Language processing (mainly Speech Recognition). Views are my own.
https://emonosuke.github.io/
Shinji Watanabe's Audio and Voice Lab | WAVLab @LTIatCMU @SCSatCMU | Speech Recognition, Speech Enhancement, Spoken Language Understanding, and more.
Deep Learning, speech recognition, language modeling, https://scholar.google.com/citations?user=qrh5CBEAAAAJ&hl=en
Open source, https://github.com/albertz/
Assistant Professor of Communication @ Stanford. Histories of AI/ML, NLP, speech recognition, and related data practices; general tomfoolery.
Bioinformatics Scientist / Next Generation Sequencing, Single Cell and Spatial Biology, Next Generation Proteomics, Liquid Biopsy, SynBio, AI/ML in biotech // http://albertvilella.substack.com
Sr. Research Scientist at the Samsung AI Center in Cambridge. Affiliated lecturer at the University of Cambridge. SpeechBrain <3
Advocate for tech that makes humans better | Spatial Computing, Holodeck, and AI Futurist | Ex-Microsoft, Rackspace | Co-author, "The Infinite Retina."
Mobile app developer - Xamarin / MAUI - Swift & SwiftUI
AI @ OpenAI, Tesla, Stanford
interspeech2026.org
27 September – 1 October, ICC, Sydney, Australia
'Speaking Together'
Proudly hosted by the Australasian Speech Science and Technology Association (ASSTA) and the International Speech Communication Association (ISCA).
Assoc. Professor at UC Berkeley
Artificial and biological intelligence and language
Linguistics Lead at Project CETI 🐳
PI Berkeley Biological and Artificial Language Lab 🗣️
College Principal of Bowles Hall 🏰
https://www.gasperbegus.com
Research Scientist @ Meta GenAI in NYC.
Working on audio/speech for LLaMA.
Previously: PhD @ JHU CLSP
desh2608.github.io
Author of "Automate the Boring Stuff with Python" and other books. Mostly harmless. Last name rhymes with "why dirt." he/him
Music and artificial intelligence.
Researcher at Stability AI.
Musician at BRNRT Collective.
Previously at Dolby and Universitat Pompeu Fabra.
artintech.substack.com
www.jordipons.me
I'm working at CMU (2021-). I was working at NTT (2001-2011), MERL (2012-2017), and JHU (2017-2020). Speech and Audio Processing is my main research topic.