Rogier van Dalen's Avatar

Rogier van Dalen

@rogiercvd

Researcher in machine learning (speech recognition / private federated learning) in Cambridge

44
Followers
83
Following
2
Posts
18.11.2024
Joined
Posts Following

Latest posts by Rogier van Dalen @rogiercvd

Preview
BLOG | Samsung Research Globally Normalizing the Transducer for Streaming Speech Recognition

There is now a blog post explaining how to fix the mathematics of streaming speech recognisers. research.samsung.com/blog/Globall...

16.04.2025 14:57 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
Globally Normalizing the Transducer for Streaming Speech Recognition The Transducer (e.g. RNN-Transducer or Conformer-Transducer) generates an output label sequence as it traverses the input sequence. It is straightforward to use in streaming mode, where it generates p...

Your streaming speech recognizer is probably mathematically flawed, degrading its accuracy. Ask me to explain how to fix this next week in the Thursday morning poster session at #ICASSP, or look at ieeexplore.ieee.org/abstract/doc...

01.04.2025 15:47 ๐Ÿ‘ 4 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
a person is using a bosch drill to drill the f5 and f6 keys ALT: a person is using a bosch drill to drill the f5 and f6 keys

#ICASSP2025

19.12.2024 11:08 ๐Ÿ‘ 3 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0