Professor, filmmaker, author. Most recently, "Bring Judgment Day: Reclaiming Lead Belly's Truths from Jim Crow's Lies" (Cambridge University Press). "[A] necessary act of historical justice" - Rock & Roll Highway
That bloke who does that podcast your annoying friend keeps trying to get you to listen to. He/him. Posts every random thought in his head. AuDHD and not many filters. If you're only interested in my podcast you probably don't want to follow.
music/audio/speech proc, generative models
PhD student (now), EECS MIT
MSc '24, SCS CMU
BSc '21, CS Nat'l Taiwan U
casual classical pianist🎹 & violist🎻
Road To Rouen 20th Anniversay Edition is out now! Listen on streaming, vinyl and 2CD bundles available to order now. http://supergrass.lnk.to/RoadToRouen2025IG
Research Scientist at Google DeepMind: Enivironmental sound understanding
https://eloweimi.github.io/
Ex-liontamer, writer/editor/strategist.
I used to work for Wired, The Verge, The Atlantic, The Message, Adweek, and Amazon Chronicles.
I also host sometimes at Kottke.org.
Call me he/him.
PHL.
Everything changes; don’t be afraid.
https://timcarmody.com
PhD Student @ Telecom Paris
Professor of Signal Processing
Head of Department of Informatics
@kingscollegelondon.bsky.social
Research in generative AI for **human** creativity in music + more.
Assistant professor at CMU CSD, leading the 🎼 G-CLef lab. Part time research scientist at Google DeepMind on the Magenta team (views my own)
I make Dad jokes on NPR and also write books and other things.
Researcher in Audio Signal Processing and Machine Learning ))))
Stanford Linguistics and Computer Science. Director, Stanford AI Lab. Founder of @stanfordnlp.bsky.social . #NLP https://nlp.stanford.edu/~manning/
universal musical approximator. research scientist at gorgle derpmind, magenta team. https://ethman.github.io
A free, collaborative, multilingual internet encyclopedia.
donate.wikipedia25.org
a mediocre combination of a mediocre AI scientist, a mediocre physicist, a mediocre chemist, a mediocre manager and a mediocre professor.
see more at https://kyunghyuncho.me/
Research scientist, Google DeepMind
My new special Night Thoughts is out NOW on Hulu.
linktr.ee/kumailnanjiani
Google Chief Scientist, Gemini Lead. Opinions stated here are my own, not those of Google. Gemini, TensorFlow, MapReduce, Bigtable, Spanner, ML things, ...
Science writer and author of books including Bright Earth, The Music Instinct, Beyond Weird, How Life Works.
News from Mitsubishi Electric Research Laboratories (MERL), Mitsubishi Electric Corporation's North American research organization.
🌐 merl.com
Senior Staff Research Scientist @Google DeepMind, former Chair Prof @Oxford Uni
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
WASPAA 2025 will be held Oct. 12-15, 2025 at Granlibakken Tahoe, Tahoe City, CA, USA.
Abstract deadline: April 23, 2025 (23:59 AOE)
Paper deadline: April 30, 2025 (23:59
Recently a principal scientist at Google DeepMind. Joining Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamical systems.
https://kyutai.org/ Open-Science AI Research Lab based in Paris
Auditory signal processing researcher.
Researcher in statistics and machine learning for genomics
https://laurent-jacob.github.io/
Sr. Research Scientist at the Samsung AI Center in Cambridge. Affiliated lecturer at the University of Cambridge. SpeechBrain <3
Working towards the safe development of AI for the benefit of all at Université de Montréal, LawZero and Mila.
A.M. Turing Award Recipient and most-cited AI researcher.
https://lawzero.org/en
https://yoshuabengio.org/profile/
Postdoc researcher @telecomparis. Previously @CNRS/LS2N @c4dm. Machine learning for audio. https://changhongw.github.io/
Research Scientist @SonyAI
PhD from Seoul National University
Previous intern @MERL, @Sony, and @Supertone
Research Scientist @ Sound Scene Understanding Team, RIKEN-AIP, Japan
Now: Audio & Multimodal ML PhD in the Music and Audio Research Lab @ NYU
Prev: Data Developer at Sonos and Northwestern, Research Intern at Adobe + Bosch Research
PhD Student @ltiatcmu.bsky.social
I work in speech processing.
wanchichen.github.io
unsound lab cat @ georgia tech
California based auditory researcher and sports photographer.
Working to understand how humans and machines hear. Prof at MIT; director of Lab for Computational Audition. https://mcdermottlab.mit.edu/
Head of Audio and Video AI Research at Adobe Research
Associate professor at Télécom Paris in machine listening and audio applied to extended reality
Milanese-Californian Digital Speech and Audio Processing Technologist @ Apple
I’m a PhD student in University of Illinois Urbana-Champaign working on audio inverse problems.
My website: https://xzwy.github.io/alanweiyang.github.io/
Senior Manager, Foundational Research , @GoogleDeepMind
Googler, Ex @Dolby & @Broadcom
Talks and Investments 👉🏽 http://portfolio.v1vek.com
Audio and AI researcher. Faculty in Siebel School at UIUC and Visiting Academic at Amazon Lab126. A working dad. Some obsolete hobbies: music, photography, drawing, and writing. Still active interests: cooking.
🏠 https://minjekim.com
official Bluesky account (check username👆)
Bugs, feature requests, feedback: support@bsky.app
Official account for the SANE series of workshops. The one-day events annually gather researchers and students in speech and audio from the Northeast of the American continent, alternately in Boston and NYC.
🌐 saneworkshop.org
I'm working at CMU (2021-). I was working at NTT (2001-2011), MERL (2012-2017), and JHU (2017-2020). Speech and Audio Processing is my main research topic.
Distinguished Scientist at Google. Computational Imaging, Machine Learning, and Vision. Posts are personal opinions. May change or disappear over time.
http://milanfar.org
Music machine learning, MIR, ML, DSP
AudioML research scientist at https://audioshake.ai, before: post-doc @inria@social.numerique.gouv.fr, Editor at https://bsky.app/profile/joss-openjournals.bsky.social
All in 17.68% of grey, located in Frankfurt (Germany)
I lead music ML research for Music. Flexitalian.
Researcher@Meta Reality Labs, working on generative models, speech enhancement, speech recognition, TTS, etc.
https://nateanl.github.io/
Farming chili peppers for fun and hot sauce 🌶️
Assistant Professor at University of Michigan | PhD from UC San Diego | Human-Centered Generative AI for Content Creation
human computer musical instruments
https://hugofloresgarcia.art/
phd candidate @northwestern
research intern @adobe
prev @spotify, @descript
chicago // honduras
Deep Learning, speech recognition, language modeling, https://scholar.google.com/citations?user=qrh5CBEAAAAJ&hl=en
Open source, https://github.com/albertz/
Music and artificial intelligence.
Researcher at Stability AI.
Musician at BRNRT Collective.
Previously at Dolby and Universitat Pompeu Fabra.
artintech.substack.com
www.jordipons.me
Challenge on Detection and Classification of Acoustic Scenes and Events.
https://dcase.community/
PhD Student @ Telecom Paris, ADASP team. Previously research scientist intern @ Deezer, Sony CSL (Music Team).
AI/ML for audio and music signal processing and synthesis.
Research Scientist at Google Deepmind working on audio/speech generation.
Tenured Assistant Professor at CentraleSupélec.
Signal processing and machine learning for speech and audio.
sleglaive.github.io
research in ML at MERL and Mila
francescopaissan.it
computers and music are (still) fun
Head of Sound Design AI Research at Adobe. Machine learning and signal processing for audio & video. Musician. He/him.
www.justinsalamon.com
Lecturer at the University of Edinburgh. Member of Centre of Speech Technology Research (CSTR).
Does research on machine learning at Sony AI, Barcelona. Works on audio analysis, synthesis, and retrieval. Likes tennis, music, and wine.
https://serrjoa.github.io/
Researcher at Adobe Research. Machine learning on audio. Screamer. Oaklander born in Barcelona. Titan. He/they 🌈
www.urinieto.com
Postdoctoral researcher at Meta
Research scientist at #Inria. Audio signal processing, Acoustics, Machine Learning, Bicycle Riding, Lindy Hop Dancing.
Assistant professor at USherbrooke. Creator of the ODAS framework. Research in speech, multichannel audio processing, robot audition, embedded AI.
francoisgrondin.com
Assoc. Professor at UC Berkeley
Artificial and biological intelligence and language
Linguistics Lead at Project CETI 🐳
PI Berkeley Biological and Artificial Language Lab 🗣️
College Principal of Bowles Hall 🏰
https://www.gasperbegus.com
Speech • Language • Learning
https://grzegorz.chrupala.me
@ Tilburg University
Reader (Associate Professor), @qmuleecs.bsky.social Queen Mary University of London - research on AI for audio. Website: https://www.seresearch.qmul.ac.uk/cmai/people/ebenetos/
https://Answer.AI & https://fast.ai founding CEO; previous: hon professor @ UQ; leader of masks4all; founding CEO Enlitic; founding president Kaggle; various other stuff…
Research Scientist @ Meta GenAI in NYC.
Working on audio/speech for LLaMA.
Previously: PhD @ JHU CLSP
desh2608.github.io
AI researcher in music, audio, LLMs.
home of fine hypertext products since 1998. https://kottke.org
Communications at the Drugs for Neglected Diseases initiative (DNDi).
Former RFI, La Croix, Mediapart reporter in Seoul. Former aid worker in North Korea. fojardias@dndi.org
Professor at Université de Lorraine/Loria/Mines Nancy. Doing research is speech and audio processing.
That maths guy from the internet.