Soham Deshmukh (@soham97) Following

ethan manilow @ethanmanilow

universal musical approximator. research scientist at gorgle derpmind, magenta team. https://ethman.github.io

IEEE WASPAA 2025 @waspaa.com

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) WASPAA 2025 will be held Oct. 12-15, 2025 at Granlibakken Tahoe, Tahoe City, CA, USA. Abstract deadline: April 23, 2025 (23:59 AOE) Paper deadline: April 30, 2025 (23:59

@drjohnhershey

@siddhant-arora

Kyutai @kyutai-labs

https://kyutai.org/ Open-Science AI Research Lab based in Paris

Laurent Besacier @lbesacier

Principal Scientist at Naver Labs Europe && Professor at University Grenoble Alpes #NLP #AI #LLMs

Graham Neubig @gneubig

Associate professor at CMU, studying natural language processing and machine learning. Co-founder All Hands AI

NeurIPS Conference @neuripsconf

San Diego Dec 2-7, 25 and Mexico City Nov 30-Dec 5, 25. Comments to this account are not monitored. Please send feedback to townhall@neurips.cc.

Pablo Samuel Castro @pcastr

Señor swesearcher @ Google DeepMind, adjunct prof at Université de Montréal and Mila. Musician. From 🇪🇨 living in 🇨🇦. https://psc-g.github.io/

Chip Huyen @chiphuyen

AI x storytelling AI Engineering: https://amazon.com/dp/1098166302 Designing ML Systems: http://amazon.com/dp/1098107969 @chipro

Jeremy Howard @howard.fm

https://Answer.AI & https://fast.ai founding CEO; previous: hon professor @ UQ; leader of masks4all; founding CEO Enlitic; founding president Kaggle; various other stuff…

Dr. Fei-Fei Li @drfeifei

Prof (CS @Stanford), Co-Director @StanfordHAI, Cofounder/CEO @theworldlabs, CoFounder @ai4allorg #AI #computervision #robotics #AI-healthcare

Devi Parikh @deviparikh

Co-CEO, Yutori. Try out Scouts at yutori.com.

Lucas Beyer (bl16) @giffmana.ai

Researcher (OpenAI. Ex: DeepMind, Brain, RWTH Aachen), Gamer, Hacker, Belgian. Anon feedback: https://admonymous.co/giffmana 📍 Zürich, Suisse 🔗 http://lucasb.eyer.be

karpathy @karpathy

AI @ OpenAI, Tesla, Stanford

Durk Kingma @dpkingma

Research scientist at Anthropic. Prev. Google Brain/DeepMind, founding team OpenAI. Computer scientist; inventor of the VAE, Adam optimizer, and other methods. ML PhD. Website: dpkingma.com

Gowthami Somepalli @gowthami

PhD-ing at UMD. Knows a little about multimodal generative models. Check out my website to know more - https://somepago.github.io/

Phillip Isola @phillipisola

Associate Professor in EECS at MIT. Neural nets, generative models, representation learning, computer vision, robotics, cog sci, AI. https://web.mit.edu/phillipi/

Sander Dieleman @sedielem

Blog: https://sander.ai/ 🐦: https://x.com/sedielem Research Scientist at Google DeepMind (WaveNet, Imagen 3, Veo, ...). I tweet about deep learning (research + software), music, generative models (personal account).

Ilaria Manco @ilariamanco

Research scientist at Google DeepMind working on music • DJ 🎶 https://ilariamanco.com/

Hilde Kuehne @hildekuehne

Professor for CS at the Tuebingen AI Center and affiliated Professor at MIT-IBM Watson AI lab - Multimodal learning and video understanding - GC for ICCV 2025 - https://hildekuehne.github.io/

Jia-Bin Huang @jbhuang0604

Associate Professor at UMD CS. YouTube: https://youtube.com/@jbhuang0604 Interested in how computers can learn and see.

Yifan Peng @pengyf

PhD Student @CMU, Speech AI Research https://pyf98.github.io/

Language Technologies Institute | CMU @ltiatcmu

The Language Technologies Institute in Carnegie Mellon University's @scsatcmu.bsky.social lti.cmu.edu

Jiatong Shi @jiatongs

Speech people @CMU

Piotr Żelasko @pzelasko

AI + Speech @ Nvidia. PhD @ AGH-UST, ex-JHU. My interests: speech processing technologies; ML/AI software engineering. Building OSS for Speech AI.

WAVLab@CMU @wavlab

Shinji Watanabe's Audio and Voice Lab | WAVLab @LTIatCMU @SCSatCMU | Speech Recognition, Speech Enhancement, Spoken Language Understanding, and more.

arXiv Sound @arxiv-sound

Automated posting of sound-related articles uploaded to arxiv.org (eess.AS + cs.SD) Source: https://github.com/dsuedholt/bsky-paperbot-sound/ Inspired by @paperposterbot.bsky.social and https://twitter.com/ArxivSound

Jonathan Le Roux @jonathanleroux

Speech and audio research scientist @MERL. saneworkshop.org co-founder. IguanaTex developer. 🌐 jonathanleroux.org 🐙 github.com/Jonathan-LeRoux/ 🎓 scholar.google.com/citations?user=aUpxty8AAAAJ&hl=en

Eric Fosler-Lussier @ericfos

Professor/Admin @ Ohio State. All opinions expressed on this channel are my personal opinions and do not represent that of my employer.

Odette Scharenborg @odettes

Full professor of inclusive speech communication at TU Delft, The Netherlands. Former president of the International Speech Communication Association (ISCA). Mother of 3🌈

Ramon Astudillo @ramon-astudillo

Principal Research Scientist at IBM Research AI in New York. Speech, Formal/Natural Language Processing. Currently LLM post-training, structured SDG and RL. Opinions my own and non stationary. ramon.astudillo.com

Greta Tuckute @gretatuckute

Studying language in biological brains and artificial ones at the Kempner Institute at Harvard University. www.tuckute.com

Jesse Engel @jesseengel

Guitarist, Researcher Google DeepMind. Opinions are my own.

@markbcartwright

Prem Seetharaman @pseeth

Researcher in computer audition, machine learning, and HCI. Sr. Research Scientist, @AdobeResearch. Previously @DescriptApp, @Northwestern. https://pseeth.github.io/

Christian Steinmetz @csteinmetz1

AI for Music • Research Scientist @ Suno

Hervé Bredin (a.k.a. the pyannote guy) @hbredin

I created pyannote open source toolkit. Co-founder and CSO at pyannoteAI

Keisuke Imoto @keisukeimoto

Vincent Lostanlen @lostanlen

Scientist at CNRS. https://audio.ls2n.fr A science game to test your musical memory: https://tunetwins.app

Johanna Devaney @jcdevaney

Canadian in NYC (she/her) teaching music and data analysis at Brooklyn College and the Graduate Center, CUNY. Co-Editor-in-Chief of Journal of New Music Research.

Steve Renals @srenals

Once was speech technologist - Water of Leith, Edinburgh - Born 320.23 ppm

Heiga Zen (全炳河) @heigazen

Principal Scientist (Director) at Google DeepMind in Japan. 波瀬小⇒一志中⇒鈴鹿高専⇒名工大 (IBM T.J. Watson Research intern)⇒東芝欧州研究所⇒Google (Speech🇬🇧⇒Brain🇯🇵) ⇒Google DeepMind. 3rd generation Korean in Japan.

Ricard Marxer @ricard

ml, audio, cv, nlp, speech, bioacoustics // Assoc. Prof. at Université de Toulon, researcher at LIS CNRS UMR 7020, director of http://www.master-mir.eu in marine robotics and AI

@imotts

KU←田辺坂←R←SOKENDAI←通信会社N

🐿🦋 @sythonuk

🐿earcher

matt @ballforest

Outlier detection / Kernel methods / Information geometry / Hopfield networks / Dynamical systems

yamakatz @kyama0321

Auditory Signal Processing/Objective Metrics/Hearing Assistive Technologies.  Twitter: @kyama0321 WEB: https://sites.google.com/site/kyama0321/en

Muramasa @muramasa2

音声の研究をしています

Kentaro Seki @trgkpc

1st-year doctoral student @ Univ. Tokyo | audio signal processing, speech synthesis, machine learning https://trgkpc.github.io/

Marianne de Heer Kloots @mdhk.net

Linguist in AI & CogSci 🧠👩‍💻🤖 PhD student @ ILLC, University of Amsterdam 🌐 https://mdhk.net/ 🐘 https://scholar.social/@mdhk 🐦 https://twitter.com/mariannedhk

Ilyass Moummad @ilyassmoummad

Postdoctoral Researcher @ Inria Montpellier (IROKO, Pl@ntNet) SSL for plant images Interested in Computer Vision, Natural Language Processing, Machine Listening, and Biodiversity Monitoring Website: ilyassmoummad.github.io

ErlendA @froskekongen

AI researcher. CTO at HANCE, Associate Professor at NTNU. Compression, generative, audio, time series

Romain Serizel @rserizel

Professor at Université de Lorraine/Loria/Mines Nancy. Doing research is speech and audio processing.

Andrew Owens @andrewowens

Associate professor @ Cornell Tech

@keunwoochoi

AI researcher in music, audio, LLMs.

Desh Raj @rdesh26

Research Scientist @ Meta GenAI in NYC. Working on audio/speech for LLaMA. Previously: PhD @ JHU CLSP desh2608.github.io

Emmanouil Benetos @emmanouilb

Reader (Associate Professor), @qmuleecs.bsky.social Queen Mary University of London - research on AI for audio. Website: https://www.seresearch.qmul.ac.uk/cmai/people/ebenetos/

Michel Olvera @michelolzam

🎧 Machine Listening Researcher

Grzegorz Chrupała @grzegorz.chrupala.me

Speech • Language • Learning https://grzegorz.chrupala.me @ Tilburg University

Gasper Begus @begus

Assoc. Professor at UC Berkeley Artificial and biological intelligence and language Linguistics Lead at Project CETI 🐳 PI Berkeley Biological and Artificial Language Lab 🗣️ College Principal of Bowles Hall 🏰 https://www.gasperbegus.com

Daan van Esch @daanvanesch.nl

I work on speech and language technologies at Google. I like languages, history, maps, traveling, cycling, and buying way too many books.

Catherine Lai @catlai

Lecturer in speech and language technology, CSTR, University of Edinburgh. https://homepages.inf.ed.ac.uk/clai/

Yossi Keshet @keshet

Speech, language, and deep learning at the Technion. But also psychology, philosophy, and history. And Jazz improv.

François Grondin @francoisgrondin

Assistant professor at USherbrooke. Creator of the ODAS framework. Research in speech, multichannel audio processing, robot audition, embedded AI. francoisgrondin.com

@naoyukikandaslp

Antoine Deleforge @adeleforge

Research scientist at #Inria. Audio signal processing, Acoustics, Machine Learning, Bicycle Riding, Lindy Hop Dancing.

Julian Lenz @jlenzyy

Audio AI research engineer w/ Lemonaide. prev. Neutone, Okio. MSc in Audio Computation at UPF. I also fly planes and play the cello sometimes!

Joan Serrà @serrjoa

Does research on machine learning at Sony AI, Barcelona. Works on audio analysis, synthesis, and retrieval. Likes tennis, music, and wine. https://serrjoa.github.io/

Julius Richter @julius-richter

Postdoctoral researcher at Meta

Oriol (Uri) Nieto @urinieto

Researcher at Adobe Research. Machine learning on audio. Screamer. Oaklander born in Barcelona. Titan. He/they 🌈 www.urinieto.com

Hao Tang @larryniven4

Lecturer at the University of Edinburgh. Member of Centre of Speech Technology Research (CSTR).

Justin Salamon @justinsalamon

Head of Sound Design AI Research at Adobe. Machine learning and signal processing for audio & video. Musician. He/him. www.justinsalamon.com

Fernando Espinosa Iñiguez @neuralvocoder

Audio ML Research @ Auto-Tune 🎤🎵 Bay Area SSBM & RL gamer Love to talk Cognitive Science, Linguistics, Bio-inspired Learning, Topological Signal Processing & TDA

SeungHeon Doh @seungheon-doh

research on llm + music (https://seungheondoh.github.io/). PhD Candidate @ Music and Audio Computing Lab, KAIST. Previously an intern @Adobe, @BytedanceTalk, @Naver, @Chartmetric.

Zachary Novack @zacknovack

Efficient+Controllable Audio Generation @ UCSD | Interning Stability AI, Adobe | Teaching drums @ POW Percussion

Lancelot @lancelotblanchard

Musician, Engineer, AI Researcher - @mitofficial.bsky.social @medialab.bsky.social

@yoshiki-masuyama

@gtzan

Francesco Paissan @fpaissan

research in ML at MERL and Mila francescopaissan.it

Luca Comanducci @lucacomanducci

Fixed-term researcher (RTDA) @polimi working on audio signal processing, music informatics, spatial audio and generative models (https://lucacoma.github.io/)

Simon Leglaive @sleglaive

Tenured Assistant Professor at CentraleSupélec. Signal processing and machine learning for speech and audio. sleglaive.github.io

@kzmolikova

Kyle Kastner @kastnerkyle

computers and music are (still) fun

Martijn Bartelds @mbartelds

Postdoctoral Scholar Stanford NLP

Interspeech 2026 @interspeech

interspeech2026.org 27 September – 1 October, ICC, Sydney, Australia 'Speaking Together' Proudly hosted by the Australasian Speech Science and Technology Association (ASSTA) and the International Speech Communication Association (ISCA).

DCASE Challenge @dcase-challenge

Challenge on Detection and Classification of Acoustic Scenes and Events. https://dcase.community/

Bernardo Torres @bernardo-torres

PhD Student @ Telecom Paris, ADASP team. Previously research scientist intern @ Deezer, Sony CSL (Music Team). AI/ML for audio and music signal processing and synthesis.

hugofloresgarcía @hugofloresgarcia

human computer musical instruments https://hugofloresgarcia.art/ phd candidate @northwestern research intern @adobe prev @spotify, @descript chicago // honduras

Albert Zeyer @albertzeyer

Deep Learning, speech recognition, language modeling, https://scholar.google.com/citations?user=qrh5CBEAAAAJ&hl=en Open source, https://github.com/albertz/

robinsch @fakufaku

Farming chili peppers for fun and hot sauce 🌶️

Zhaoheng Ni @nateanl

Researcher@Meta Reality Labs, working on generative models, speech enhancement, speech recognition, TTS, etc. https://nateanl.github.io/

Jordi Pons @jordiponsdotme

Music and artificial intelligence. Researcher at Stability AI. Musician at BRNRT Collective. Previously at Dolby and Universitat Pompeu Fabra. artintech.substack.com www.jordipons.me

Hao-Wen (Herman) Dong 董皓文 @hermandong

Assistant Professor at University of Michigan | PhD from UC San Diego | Human-Centered Generative AI for Content Creation

Yoshiaki Bando @yoshipon0520

Matthias Mauch @matthiasmauch

I lead music ML research for Music. Flexitalian.

Faro Stöter @faroit

AudioML research scientist at https://audioshake.ai, before: post-doc @inria@social.numerique.gouv.fr, Editor at https://bsky.app/profile/joss-openjournals.bsky.social All in 17.68% of grey, located in Frankfurt (Germany)

Soham Deshmukh

Following (97)