Trending
Shahroz Butt's Avatar

Shahroz Butt

@shahrozbutt

BS student @UAF | exploring deep learning

140
Followers
1,079
Following
2
Posts
23.11.2024
Joined
Posts Following

Latest posts by Shahroz Butt @shahrozbutt

Video thumbnail

Automatic differentiation in forward mode computes derivatives by breaking down functions into elem operations and propagating derivatives alongside values. It’s efficient for functions with fewer inputs than outputs and for Jacobian-vect prod, using for instance dual numbers.

13.12.2024 06:00 πŸ‘ 37 πŸ” 10 πŸ’¬ 2 πŸ“Œ 0
A figure from the attached paper showing the difference in output between a benchmark model, and one with the super weight removed. The benchmark model generates a reasonable answer, the one where the weight is missing generates complete gibberish

A figure from the attached paper showing the difference in output between a benchmark model, and one with the super weight removed. The benchmark model generates a reasonable answer, the one where the weight is missing generates complete gibberish

#ai, #ml or #llm people here, what do you think about the β€œsuper weight” paper?

TLDR: deleting one single weight from a 7B model turns it completely incoherent, destroying it’s ability to generate legible text.

arxiv.org/pdf/2411.07191

01.12.2024 07:05 πŸ‘ 33 πŸ” 7 πŸ’¬ 3 πŸ“Œ 0

Add me please

28.11.2024 03:51 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

What are these starter packs? What are the requirements to get in?

28.11.2024 02:41 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
a panda bear is rolling around in the grass in a zoo enclosure . Alt: a panda bear is rolling around in the grass in a zoo enclosure .

No one can explain stochastic gradient descent better than this panda.

24.11.2024 15:04 πŸ‘ 216 πŸ” 32 πŸ’¬ 10 πŸ“Œ 6

I noticed a lot of starter packs skewed towards faculty/industry, so I made one of just NLP & ML students: go.bsky.app/vju2ux

Students do different research, go on the job market, and recruit other students. Ping me and I'll add you!

23.11.2024 19:54 πŸ‘ 176 πŸ” 54 πŸ’¬ 101 πŸ“Œ 4