yamakatz (@kyama0321)

Junya Koguchi, Tomoki Koriyama: Voting-based Pitch Estimation with Temporal and Frequential Alignment and Correlation Aware Selection https://arxiv.org/abs/2602.01727 https://arxiv.org/pdf/2602.01727 https://arxiv.org/html/2602.01727

03.02.2026 06:34 👍 0 🔁 2 💬 0 📌 0

GitHub - CyberAgentAILab/aesca: AESCA (Yamamoto+, 2025), the top-performing system in AudioMOS Challenge Track 2 to predict the audio aesthetics score (AES) AESCA (Yamamoto+, 2025), the top-performing system in AudioMOS Challenge Track 2 to predict the audio aesthetics score (AES) - CyberAgentAILab/aesca

We released the inference code and model of a part of AESCA (Yamamoto+, #ASRU2025), the top-performing system in AudioMOS Challenge Track 2, to predict the audio aesthetics score (AES).

Paper: arxiv.org/abs/2512.05592
Code: github.com/CyberAgentAI...

02.02.2026 03:25 👍 0 🔁 0 💬 0 📌 0

At the IEEE #ASRU2025, we presented our automatic evaluation system for generated audio, which won first place in the AudioMOS Challenge 2025 Track 2🥇. At the start of the session, an award ceremony was held, and I accepted the certificate on behalf of the team.

15.12.2025 14:09 👍 0 🔁 0 💬 0 📌 0

Today’s poster presentation at #ASRU2025 🥳
Preprint: arxiv.org/abs/2512.05592

09.12.2025 20:19 👍 0 🔁 0 💬 0 📌 1

It's the best system in AudioMOS Challenge 2025 Track 2👑
sites.google.com/view/voicemo...

08.12.2025 09:54 👍 0 🔁 0 💬 0 📌 1

On Dec 9th, 4:00 PM, we will be giving a poster presentation titled “The T12 System for AudioMOS Challenge 2025: Audio Aesthetics Score Prediction System Using KAN- and VERSA-based Models” at ASRU2025 in Honolulu. #ASRU2025
Preprint: arxiv.org/abs/2512.05592

08.12.2025 03:12 👍 0 🔁 0 💬 1 📌 0

We are attending #ASRU2025 in Honolulu!!!🏝️🌺 The conference center is very close to Waikiki beach 🌊🏄🌈

07.12.2025 03:14 👍 0 🔁 0 💬 0 📌 1

Thank you for attending my talk. I'm happy to contribute to the special session on spectrotemporal modulation!
eppro02.ativ.me/web/index.ph...

04.12.2025 06:54 👍 1 🔁 0 💬 0 📌 0

On Dec 3rd, 4:00 PM, I will be giving an invited talk titled "Towards Machine Learning-Driven Speech Intelligibility Prediction Models: Examining Relationships with Spectrotemporal Modulation" at the 6th ASA/ASJ joint meeting in Honolulu🏝️🌺 #ASAASJ25
eppro02.ativ.me//web/index.p...

03.12.2025 18:37 👍 1 🔁 0 💬 0 📌 1

Had such a great time presenting our tutorial on Interpretability Techniques for Speech Models at #Interspeech2025! 🔍

For anyone looking for an introduction to the topic, we've now uploaded all materials to the website: interpretingdl.github.io/speech-inter...

19.08.2025 21:23 👍 40 🔁 14 💬 2 📌 1

I finished my presentation. Thank you for attending the session and discussion! #Interspeech2025

21.08.2025 18:32 👍 0 🔁 0 💬 0 📌 0

🇳🇱🌷🐨🇦🇺
#Interspeech2026KoalaCompetition
#Interspeech2026
#Interspeech2025

20.08.2025 21:29 👍 0 🔁 0 💬 0 📌 0

#Interspeech2025

20.08.2025 20:09 👍 0 🔁 0 💬 0 📌 0

Bauquet at Stadshaven Brouwerij & Gastropub🍻🎸🥁🎺🎹⛴️ #Interspeech2025

20.08.2025 17:53 👍 1 🔁 0 💬 1 📌 0

ISCA Archive - Non-Intrusive Binaural Speech Intelligibility Prediction Using Mamba for Hearing-Impaired Listeners

I'll be presenting my paper at #Interspeech2025 :
Area6-Oral6-1330-1 “Non-Intrusive Binaural Speech Intelligibility Prediction Using Mamba for Hearing-Impaired Listeners"
www.isca-archive.org/interspeech_...

18.08.2025 08:50 👍 0 🔁 0 💬 0 📌 0

#Interspeech2025 opens!!🌷💃🕺🌷

18.08.2025 06:50 👍 0 🔁 0 💬 0 📌 0

I'm attending #Interspeech2025 in Rotterdam 🇳🇱

17.08.2025 11:56 👍 1 🔁 0 💬 1 📌 0

https://www.interspeech2025.org/abstract-book-proceedings Your cookies are disabled, please enable them.

😍 Check out the #Interspeech2025 Proceedings!

www.interspeech2025.org/abstract-boo...

15.08.2025 08:47 👍 4 🔁 3 💬 0 📌 1

Our paper has been accepted for #Interspeech2025

Non-Intrusive Binaural Speech Intelligibility Prediction Using Mamba for Hearing-Impaired Listeners 🦻🐍

See you in Rotterdam🇳🇱

19.05.2025 12:02 👍 3 🔁 0 💬 0 📌 0

Our team's year-end party was held in Shibuya, Tokyo🍶 My colleagues gave me a wedding gift🎁 Thanks!

23.12.2024 14:39 👍 1 🔁 0 💬 0 📌 0

Views from the office window. Photo taken just now.

Do you want to work with me for some months? Two internship positions available at the Music Team of Sony AI in Barcelona!
👇

23.12.2024 08:13 👍 11 🔁 4 💬 1 📌 0

Unfortunately, my paper for ICASSP 2025 was rejected🥺 Thanks to the reviewers and AC for the peer review🙏 I will work hard on my next submission, reflecting the useful comments I received on my research.

21.12.2024 00:13 👍 0 🔁 0 💬 0 📌 0

Postdoc Position: Computational Modelling of Speech Recognition at the Donders Centre for Cognition | Radboud University Do you want to work as a Postdoc Position: Computational Modelling of Speech Recognition at the Donders Centre for Cognition at the Faculty of Social Sciences? Check our vacancy!

📣Amazing opportunity for #speech researchers!

Postdoc Position: Computational Modelling of Speech Recognition at the Donders Centre for Cognition, Radboud University, Nijmegen, the Netherlands

More info: www.ru.nl/en/working-a...

20.12.2024 09:48 👍 14 🔁 15 💬 0 📌 1

Multi-objective non-intrusive hearing-aid speech assessment model Because a reference signal is often unavailable in real-world scenarios, reference-free speech quality and intelligibility assessment models are important for m

👀🦻 > Multi-objective non-intrusive hearing-aid speech assessment model
pubs.aip.org/asa/jasa/art...

10.12.2024 23:04 👍 1 🔁 0 💬 0 📌 0

IEEE SPS Webinars | 15 Jan 2025 Join us for an expert-led webinar to explore cutting-edge topics in signal processing. Register now and stay updated on upcoming events!

🤖👂 > SPS SLTC/AASP TECHNICAL COMMITTEE WEBINAR
Audio Signal Enhancement: A Weakly Supervised Deep Learning Approach
15 January 2025
Presented by Dr. Nobutaka Ito & Dr. Yoshiaki Bando
landing.signalprocessingsociety.org/jan-15-2024

10.12.2024 11:51 👍 0 🔁 0 💬 0 📌 0

A paper explaining how, in order to succeed in training a CLIP-like contrastive-based VL model, the alignment between the image and text encoders should be maintained
arxiv.org/abs/2412.04616

10.12.2024 06:55 👍 3 🔁 1 💬 0 📌 0

OHHR – The Oldenburg Hearing Health Repository [Dataset] Description of the dataset The Oldenburg Hearing Health Repository (OHHR) provides a publicly accessible dataset that can be used to advance hearing health research. It includes a constellation of dat...

👀👂　> OHHR – The Oldenburg Hearing Health Repository [Dataset]
zenodo.org/records/1417...

09.12.2024 05:22 👍 1 🔁 0 💬 0 📌 0

Donated to arXiv for open science🕊️

06.12.2024 08:46 👍 0 🔁 0 💬 0 📌 0

Altmetric introduces Bluesky as a new social media tracking source Altmetric has expanded its tracking capabilities by integrating Bluesky, as a new attention source.

🦋🎓👀 > Altmetric introduces Bluesky as a new social media tracking source - Altmetric
www.altmetric.com/altmetric-ne...

06.12.2024 04:17 👍 0 🔁 0 💬 0 📌 0

yamakatz

Latest posts by yamakatz @kyama0321