Trending
Rudolf Arseni Braun's Avatar

Rudolf Arseni Braun

@fasttosmile

Here for AI stuff. Currently ASR@AWS Sometimes write on rudolfarseni.me

18
Followers
129
Following
2
Posts
19.11.2024
Joined
Posts Following

Latest posts by Rudolf Arseni Braun @fasttosmile

Maybe it's good to play games because it's a way to get a verifiable reward signal for one's thoughts.

19.03.2025 18:21 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

100% agree on the fly UIs is the future. It will still take a bit for our models to be good enough at generating dense data (i.e. not text) but it will happen.
Is tricky to imagine and reason about the downstream effects, wish I could get a glimpse of the future!

14.12.2024 23:26 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
CTCLoss gradient is incorrect ยท Issue #52241 ยท pytorch/pytorch ๐Ÿ› Bug Hi, While working on some CTC extensions, I noticed that torch's CTCLoss was computing incorrect gradient. At least when using CPU (I have not tested on GPU yet). I observed this problem on b...

I just learned that Torch ctc_loss calculates the wrong gradient (but when there was log_softmax before, it does not matter).

For the grad ctc_loss w.r.t. log_probs, it calculates exp(log_probs) - y, but correct would be -y. Some workaround: github.com/pytorch/pyto...

PS: First Bluesky post.

26.11.2024 23:16 ๐Ÿ‘ 10 ๐Ÿ” 2 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Observing the responses here and on twitter made me reflect, realize and act. There is a real difference, and I describe it (as well as some of the non-differences) here:

gist.github.com/yoavg/9142e5...

23.11.2024 22:35 ๐Ÿ‘ 12 ๐Ÿ” 4 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 1