Oh yeah, such a good idea, I even didnβt think of it!! Definitely consider that. Btw thx for kind words.
@unclecode
Author of Crawl4AI (#1 GitHub Trending, 16K+ stars). Founder of Kidocode, SE Asia's largest tech & biz school. Based in Singapore, running an AI research lab on synthetic data. AI researcher/consultant, systems engineer, musician, coffee enthusiast.
Oh yeah, such a good idea, I even didnβt think of it!! Definitely consider that. Btw thx for kind words.
I have to write a short essay on this to convey my message!
In my view, automate coding tools often fail with complex tasks because of a lack of linguistic awareness. Both the developers building these systems and the users relying on them miss the importance of language as the core mechanism driving these models, limiting their effectiveness.
Language models donβt truly βunderstandβ function calls, they interpret language. We guide them by embedding functions as part of a linguistic framework. To build better AI tools, developers need to understand linguistics and the philosophy of language, an area often overlooked.
This definitely outshines OpenAI GPTs! You can wrap any tools into a simple server app, pass it to Claude, and thatβs it. Claude can use them all! Itβs far more flexible and, with good design, allows for scalability. To me, this is one of the best moves in increasing LLM utility in recent years.
To me a web crawler, mixed with a language model, works like a therapist unraveling the webβs tangled chaos to uncover hidden truths. It mirrors the cycle of knowledge: extracting what weβve built (web), transforming it, and returning it as nourishing rain for the mind (LLM)
Amazing, was thinking of it today, thx for sharing the link
Nostalgic! HSB2 dataset and descriptive statsβ¦ π such a good time, for me was like ~15 years ago π but hsb2, I think was around 2015β¦ not sure, hv to define the confidence interval π
We occasionally chat on X, but you don't following me. Hopefully, we can connect here. I focus on synthetic data generation, started with Crawl4AI, an open-source crawler and data extraction tool, and planning to continue with a synthetic-data-on-demand pipeline.
How can I dm you? Like to join to community.
I hope this new platform will be different, where posts actually reach people who care. I still love X for the friends and connections Iβve made, but this issue is big for me. Letβs see how it goes here, hoping for better!
5/5 π€
This has left me replying to othersβ posts from people I like just to stay visible and get exposure. But thatβs not the only thing I want to do, I want to share my own ideas. It feels disconnected, like I canβt reach the audience that matters.
4/5 π§΅
My first post about my repo got 90K views and great engagement. Since then, nothing. People say you need to βvolume postβ (spam posts all day hoping one breaks through). What the hell!! Why should I hack algorithms just to reach relevant people? Is this designed to force us to pay to boost?
3/5 π§΅
No matter what I post, even random words, the X algorithm caps my impressions at 200. Meanwhile, low-quality posts easily get thousands of views. When only 200 people see my posts, how can I reach those who actually need my content? No way to accept only 200 in world are relevant to my posts!
2/5 π§΅
I would like to explain one of my reason yo try Bluesky, beside @jph.bsky.social motivation. I have 2,000 followers on Platform X, Iβm the author of a top-trending repo, and Iβm very active with my GitHub community. But despite that, itβs become unbelievably hard to reach people on X.
1/5 π§΅
Done and I connect my main reporr github.com/unclecode/cr...
Hello Sky! My very first post and there you go @jph.bsky.social, in result of your inspiration, I became a bird in the blue sky. I set my first milestone for a 10k enthusiastic followers on AI, data extraction/engineering and brewing coffee!