universal musical approximator. research scientist at gorgle derpmind, magenta team. https://ethman.github.io
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
WASPAA 2025 will be held Oct. 12-15, 2025 at Granlibakken Tahoe, Tahoe City, CA, USA.
Abstract deadline: April 23, 2025 (23:59 AOE)
Paper deadline: April 30, 2025 (23:59
https://kyutai.org/ Open-Science AI Research Lab based in Paris
Principal Scientist at Naver Labs Europe && Professor at University Grenoble Alpes
#NLP #AI #LLMs
Associate professor at CMU, studying natural language processing and machine learning. Co-founder All Hands AI
San Diego Dec 2-7, 25 and Mexico City Nov 30-Dec 5, 25. Comments to this account are not monitored. Please send feedback to townhall@neurips.cc.
Señor swesearcher @ Google DeepMind, adjunct prof at Université de Montréal and Mila. Musician. From 🇪🇨 living in 🇨🇦.
https://psc-g.github.io/
AI x storytelling
AI Engineering: https://amazon.com/dp/1098166302
Designing ML Systems: http://amazon.com/dp/1098107969
@chipro
https://Answer.AI & https://fast.ai founding CEO; previous: hon professor @ UQ; leader of masks4all; founding CEO Enlitic; founding president Kaggle; various other stuff…
Prof (CS @Stanford), Co-Director @StanfordHAI, Cofounder/CEO @theworldlabs, CoFounder @ai4allorg #AI #computervision #robotics #AI-healthcare
Co-CEO, Yutori. Try out Scouts at yutori.com.
Researcher (OpenAI. Ex: DeepMind, Brain, RWTH Aachen), Gamer, Hacker, Belgian.
Anon feedback: https://admonymous.co/giffmana
📍 Zürich, Suisse 🔗 http://lucasb.eyer.be
AI @ OpenAI, Tesla, Stanford
Research scientist at Anthropic. Prev. Google Brain/DeepMind, founding team OpenAI. Computer scientist; inventor of the VAE, Adam optimizer, and other methods. ML PhD. Website: dpkingma.com
PhD-ing at UMD. Knows a little about multimodal generative models. Check out my website to know more - https://somepago.github.io/
Associate Professor in EECS at MIT. Neural nets, generative models, representation learning, computer vision, robotics, cog sci, AI.
https://web.mit.edu/phillipi/
Blog: https://sander.ai/
🐦: https://x.com/sedielem
Research Scientist at Google DeepMind (WaveNet, Imagen 3, Veo, ...). I tweet about deep learning (research + software), music, generative models (personal account).
Research scientist at Google DeepMind working on music • DJ 🎶
https://ilariamanco.com/
Professor for CS at the Tuebingen AI Center and affiliated Professor at MIT-IBM Watson AI lab - Multimodal learning and video understanding - GC for ICCV 2025 - https://hildekuehne.github.io/
Associate Professor at UMD CS. YouTube: https://youtube.com/@jbhuang0604
Interested in how computers can learn and see.
PhD Student @CMU, Speech AI Research
https://pyf98.github.io/
The Language Technologies Institute in Carnegie Mellon University's @scsatcmu.bsky.social
lti.cmu.edu
AI + Speech @ Nvidia. PhD @ AGH-UST, ex-JHU. My interests: speech processing technologies; ML/AI software engineering. Building OSS for Speech AI.
Shinji Watanabe's Audio and Voice Lab | WAVLab @LTIatCMU @SCSatCMU | Speech Recognition, Speech Enhancement, Spoken Language Understanding, and more.
Automated posting of sound-related articles uploaded to arxiv.org (eess.AS + cs.SD)
Source: https://github.com/dsuedholt/bsky-paperbot-sound/
Inspired by @paperposterbot.bsky.social and https://twitter.com/ArxivSound
Speech and audio research scientist @MERL. saneworkshop.org co-founder. IguanaTex developer.
🌐 jonathanleroux.org
🐙 github.com/Jonathan-LeRoux/
🎓 scholar.google.com/citations?user=aUpxty8AAAAJ&hl=en
Professor/Admin @ Ohio State. All opinions expressed on this channel are my personal opinions and do not represent that of my employer.
Full professor of inclusive speech communication at TU Delft, The Netherlands. Former president of the International Speech Communication Association (ISCA). Mother of 3🌈
Principal Research Scientist at IBM Research AI in New York. Speech, Formal/Natural Language Processing. Currently LLM post-training, structured SDG and RL. Opinions my own and non stationary.
ramon.astudillo.com
Studying language in biological brains and artificial ones at the Kempner Institute at Harvard University.
www.tuckute.com
Guitarist, Researcher Google DeepMind. Opinions are my own.
Researcher in computer audition, machine learning, and HCI. Sr. Research Scientist, @AdobeResearch. Previously @DescriptApp, @Northwestern.
https://pseeth.github.io/
AI for Music • Research Scientist @ Suno
I created pyannote open source toolkit.
Co-founder and CSO at pyannoteAI
Scientist at CNRS.
https://audio.ls2n.fr
A science game to test your musical memory: https://tunetwins.app
Canadian in NYC (she/her) teaching music and data analysis at Brooklyn College and the Graduate Center, CUNY. Co-Editor-in-Chief of Journal of New Music Research.
Once was speech technologist - Water of Leith, Edinburgh - Born 320.23 ppm
Principal Scientist (Director) at Google DeepMind in Japan. 波瀬小⇒一志中⇒鈴鹿高専⇒名工大 (IBM T.J. Watson Research intern)⇒東芝欧州研究所⇒Google (Speech🇬🇧⇒Brain🇯🇵) ⇒Google DeepMind. 3rd generation Korean in Japan.
ml, audio, cv, nlp, speech, bioacoustics // Assoc. Prof. at Université de Toulon, researcher at LIS CNRS UMR 7020, director of http://www.master-mir.eu in marine robotics and AI
Outlier detection / Kernel methods / Information geometry / Hopfield networks / Dynamical systems
Auditory Signal Processing/Objective Metrics/Hearing Assistive Technologies.
Twitter: @kyama0321
WEB: https://sites.google.com/site/kyama0321/en
1st-year doctoral student @ Univ. Tokyo | audio signal processing, speech synthesis, machine learning
https://trgkpc.github.io/
Linguist in AI & CogSci 🧠👩💻🤖 PhD student @ ILLC, University of Amsterdam
🌐 https://mdhk.net/
🐘 https://scholar.social/@mdhk
🐦 https://twitter.com/mariannedhk
Postdoctoral Researcher @ Inria Montpellier (IROKO, Pl@ntNet)
SSL for plant images
Interested in Computer Vision, Natural Language Processing, Machine Listening, and Biodiversity Monitoring
Website: ilyassmoummad.github.io
AI researcher.
CTO at HANCE, Associate Professor at NTNU.
Compression, generative, audio, time series
Professor at Université de Lorraine/Loria/Mines Nancy. Doing research is speech and audio processing.
Associate professor @ Cornell Tech
AI researcher in music, audio, LLMs.
Research Scientist @ Meta GenAI in NYC.
Working on audio/speech for LLaMA.
Previously: PhD @ JHU CLSP
desh2608.github.io
Reader (Associate Professor), @qmuleecs.bsky.social Queen Mary University of London - research on AI for audio. Website: https://www.seresearch.qmul.ac.uk/cmai/people/ebenetos/
🎧 Machine Listening Researcher
Speech • Language • Learning
https://grzegorz.chrupala.me
@ Tilburg University
Assoc. Professor at UC Berkeley
Artificial and biological intelligence and language
Linguistics Lead at Project CETI 🐳
PI Berkeley Biological and Artificial Language Lab 🗣️
College Principal of Bowles Hall 🏰
https://www.gasperbegus.com
I work on speech and language technologies at Google. I like languages, history, maps, traveling, cycling, and buying way too many books.
Lecturer in speech and language technology, CSTR, University of Edinburgh.
https://homepages.inf.ed.ac.uk/clai/
Speech, language, and deep learning at the Technion. But also psychology, philosophy, and history. And Jazz improv.
Assistant professor at USherbrooke. Creator of the ODAS framework. Research in speech, multichannel audio processing, robot audition, embedded AI.
francoisgrondin.com
Research scientist at #Inria. Audio signal processing, Acoustics, Machine Learning, Bicycle Riding, Lindy Hop Dancing.
Audio AI research engineer w/ Lemonaide. prev. Neutone, Okio. MSc in Audio Computation at UPF. I also fly planes and play the cello sometimes!
Does research on machine learning at Sony AI, Barcelona. Works on audio analysis, synthesis, and retrieval. Likes tennis, music, and wine.
https://serrjoa.github.io/
Postdoctoral researcher at Meta
Researcher at Adobe Research. Machine learning on audio. Screamer. Oaklander born in Barcelona. Titan. He/they 🌈
www.urinieto.com
Lecturer at the University of Edinburgh. Member of Centre of Speech Technology Research (CSTR).
Head of Sound Design AI Research at Adobe. Machine learning and signal processing for audio & video. Musician. He/him.
www.justinsalamon.com
Audio ML Research @ Auto-Tune 🎤🎵
Bay Area SSBM & RL gamer
Love to talk Cognitive Science, Linguistics, Bio-inspired Learning, Topological Signal Processing & TDA
research on llm + music (https://seungheondoh.github.io/).
PhD Candidate @ Music and Audio Computing Lab, KAIST. Previously an intern @Adobe, @BytedanceTalk, @Naver, @Chartmetric.
Efficient+Controllable Audio Generation @ UCSD | Interning Stability AI, Adobe | Teaching drums @ POW Percussion
Musician, Engineer, AI Researcher - @mitofficial.bsky.social @medialab.bsky.social
research in ML at MERL and Mila
francescopaissan.it
Fixed-term researcher (RTDA) @polimi
working on audio signal processing, music informatics, spatial audio and generative models (https://lucacoma.github.io/)
Tenured Assistant Professor at CentraleSupélec.
Signal processing and machine learning for speech and audio.
sleglaive.github.io
computers and music are (still) fun
Postdoctoral Scholar Stanford NLP
interspeech2026.org
27 September – 1 October, ICC, Sydney, Australia
'Speaking Together'
Proudly hosted by the Australasian Speech Science and Technology Association (ASSTA) and the International Speech Communication Association (ISCA).
Challenge on Detection and Classification of Acoustic Scenes and Events.
https://dcase.community/
PhD Student @ Telecom Paris, ADASP team. Previously research scientist intern @ Deezer, Sony CSL (Music Team).
AI/ML for audio and music signal processing and synthesis.
human computer musical instruments
https://hugofloresgarcia.art/
phd candidate @northwestern
research intern @adobe
prev @spotify, @descript
chicago // honduras
Deep Learning, speech recognition, language modeling, https://scholar.google.com/citations?user=qrh5CBEAAAAJ&hl=en
Open source, https://github.com/albertz/
Farming chili peppers for fun and hot sauce 🌶️
Researcher@Meta Reality Labs, working on generative models, speech enhancement, speech recognition, TTS, etc.
https://nateanl.github.io/
Music and artificial intelligence.
Researcher at Stability AI.
Musician at BRNRT Collective.
Previously at Dolby and Universitat Pompeu Fabra.
artintech.substack.com
www.jordipons.me
Assistant Professor at University of Michigan | PhD from UC San Diego | Human-Centered Generative AI for Content Creation
I lead music ML research for Music. Flexitalian.
AudioML research scientist at https://audioshake.ai, before: post-doc @inria@social.numerique.gouv.fr, Editor at https://bsky.app/profile/joss-openjournals.bsky.social
All in 17.68% of grey, located in Frankfurt (Germany)