Ashvanth.S's Avatar

Ashvanth.S

@ashvanths

Deep Learning Practitioner | Language Lead for Tamil @ HuggingFace | Interested in Continual Learning and Generative Models | Website : https://ash-01xor.github.io/ X : https://twitter.com/ashvanth_s1

63
Followers
84
Following
61
Posts
26.11.2024
Joined
Posts Following

Latest posts by Ashvanth.S @ashvanths

Preview
In pursuit of In the span of ten seconds, a fleeting moment, lives can be transformed forever. 100 Meters (Hyakuemu), produced by Rock 'n' Roll Mountain, is a film based u...

"Code appears. It looks right. The diff gets approved. But no one held it in their head. No one walked through the house; they just glanced at a photograph and called it home."
Wrote about coding, Sisyphus, and building where the line keeps moving.

link: asherr.bearblog.dev/in-pursuit-of/

26.01.2026 12:54 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image Post image

Have you ever felt like you lost your focus while reading a book and wandered into deep internet rabbit holes?

Introducing sollu : AI-powered dictionary. Uses the Gemini model under the hood. It is open-sourced as well :).

11.05.2025 16:23 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Quite a humbling experience every day while coding. You start with an issue and a vision about how to solve the problem and then pretty much the road traveled often to reach the solution isn't straightforward.

Humbled each and every day to understand and accept and that it is how it is.

26.02.2025 14:09 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Pretty similar to how Jio first gained share of the internet users in India. Interesting to note big companies have the ability to shell out too much to develop and operate to gain market share. Only time shall tell what this will lead to

26.02.2025 02:40 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Ahh finally a blog post from you , it is quite difficult to maintain a site right like publishing frequent posts

19.02.2025 12:18 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Over a period of time , getting to realize that im having my flow states during certain periods of time and getting to schedule tasks around it.
Guess the goal is to build systems that can make sure we enter such states like on and off button.

19.02.2025 12:17 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Not able to point of the difference particularly , but gpt-4o-mini seems to work way too fast over the last day. From taking around 4 to 5 mins to process a 65-page PDF for extraction, it takes around 3 mins.

Do you guys want me to run benchmark tests and probably write a blog post about it ?

13.02.2025 10:59 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Looking forward to the next unit of the Agents course and building more @benburtenshaw.bsky.social @hf.co

13.02.2025 04:29 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
Understanding Reasoning LLMs Methods and Strategies for Building and Refining Reasoning Models

I just finished writing up my take on reasoning models: magazine.sebastianraschka.com/p/understand...
Here, I
1. Discuss the advantages & disadvantages of reasoning models
2. Of course, describe and discuss DeepSeek R1
3. Describe the 4 main ways to building & improving reasoning models

05.02.2025 13:46 ๐Ÿ‘ 93 ๐Ÿ” 21 ๐Ÿ’ฌ 3 ๐Ÿ“Œ 1
GitHub - ash-01xor/Rebuild-LLM: Building Large language model from scratch Building Large language model from scratch. Contribute to ash-01xor/Rebuild-LLM development by creating an account on GitHub.

Slowly building it one at a time. Thanks to @sebastianraschka.com for his book. Implementing things from scratch takes a lot of time , but valuable experience.
github.com/ash-01xor/Re...

03.02.2025 17:51 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Building SmolGPT myself , have plans to extend it. but before that struggling with managing python versions !!!

Had to use pyenv and then pip. like now i get why experienced devs are frustrated with python package management

03.02.2025 13:14 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Updated my site after quite a long time also added a note for how to update your arch linux system. Do check it out if you use arch or if you like to as well :)

02.02.2025 14:29 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Only a few more annotations are needed to complete the initial goal. for Tamil.
Do join the initiative alongside me , your contribution is highly valuble

07.01.2025 12:03 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Interesting to see the hype of agents and using them , but almost everyone who uses the term throws it away just like that.
All I get to see is a clearly well-defined workflow in a constrained environments most of the time and yet they are being called 'agents'.

07.01.2025 12:00 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Got to find this today only in Python when i made a typo by mistake.
How does the for loop work when i present the number inside range like that ??

05.01.2025 17:09 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Happy new year sebastian !! was waiting for the post

01.01.2025 14:15 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
tam - เฎคเฎฎเฎฟเฎดเฏ - Tamil Join and contribute to the dataset tam - เฎคเฎฎเฎฟเฎดเฏ - Tamil

Well, we are halfway through our initial goal of the Fineweb-C sprint for Tamil. Hopefully I would love to complete the initial goal of annotating 1000 texts within the next two days

Do join if you would like to contribute!

data-is-better-together-fineweb-c.hf.space/share-your-p...

30.12.2024 02:51 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Since being used to python development from the start i dont think i never had an issue using pyenv , venv , conda etc. Like it never felt like a chore. But then hearing about devs from other communities really does make me question why .

23.12.2024 14:50 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

got to read that alec radford left open ai , like what is even happening at open ai

20.12.2024 13:38 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

which in itself is based on the success of their previous films.
As risks taken decreases due to a formulaic process , so does the excitement and the curiosity.

17.12.2024 12:45 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

the big names present in the resume is overlooked as a factor of judgement for their talent or in making films where rather than the concept or story , the focus shifts to the kind of artists brought in to play the characters , their star power and influence to bring audience to theaters ...

17.12.2024 12:45 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Somehow deep down i always get to think about how optimization of any process leads to boredom over a period of time. The excitement and the risks once taken might decreases due to the numbers the clouds our judgement.

Like while recruiting , where folks are given standard questions to solve or ..

17.12.2024 12:45 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Big thanks to @dvilasuero.hf.co , @nataliaelv.hf.co and team ๐Ÿ™Œ. Would love to see more people join this effort

14.12.2024 07:47 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
tam - เฎคเฎฎเฎฟเฎดเฏ - Tamil Join and contribute to the dataset tam - เฎคเฎฎเฎฟเฎดเฏ - Tamil

Well, around 10 percent of the initial goal is complete, and so far, it's been quite a one-man army effort. We're still in the hunt for more people to join and contribute to this open-source initiative.

@hf.co

data-is-better-together-fineweb-c.hf.space/share-your-p...

14.12.2024 07:33 ๐Ÿ‘ 4 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
tam - เฎคเฎฎเฎฟเฎดเฏ - Tamil Join and contribute to the dataset tam - เฎคเฎฎเฎฟเฎดเฏ - Tamil

The process has just begun, and we are actively seeking collaborators for Tamil. Join us in this open-source initiative!

Building better models demands a better annotation process, and we are deeply committed to achieving this together

data-is-better-together-fineweb-c.hf.space/share-your-p...

13.12.2024 07:38 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

While these are like the summary of what he considers to be the trends going on right now , interesting to note how it might span out in the future.

Looking forward to building now !

12.12.2024 16:45 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

- Enterprise Search: Integrating LLMs with search capabilities empowers intelligent assistants to manage vast knowledge bases effectively.

- Assistant Applications: These solutions improve workflows by providing accurate, context-aware information.

12.12.2024 16:45 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

- Support Customization: Adaptation of models to domain-specific data for optimal performance.

Trend 3: The Convergence of LLMs and Search
Large language models (LLMs) and search are increasingly intertwined, revolutionizing information retrieval:
....

12.12.2024 16:45 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Trend 2 : Platform Choice matters
The right platform can determine the success of AI initiatives. Enterprises benefit from platforms that:
- Provide Pretrained Models : Easy access to SOTA models.
- Enable Production Management: Seamless monitoring and scaling in real-world deployments....

12.12.2024 16:45 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Democratization: AI tools are increasingly accessible, enabling more people to develop AI without extensive resources.

Generalization Across Tasks: The shift towards universal models capable of performing millions of tasks replaces the need for task-specific models....

12.12.2024 16:45 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0