Sugato Ray (@sugatoray)

🏎️ You can perform 200ms search over 40 million texts using just a CPU server, 8GB of RAM, and 45GB of disk space.

The trick: Binary search with int8 rescoring.

I'll show you a demo & how it works in the 🧵:

06.01.2026 19:56 👍 19 🔁 1 💬 1 📌 1

🎁 Check out this @PyTorch blog showing how to get peak performance using torch.compile + diffusers libraries in Python.

🍀 Blog: pytorch.org/blog/torch-c... torch.compile and Diffusers: A Hands-On Guide to Peak Performance – PyTorch

#PyTorch #torchCompile #diffusers #python

18.07.2025 03:28 👍 0 🔁 0 💬 0 📌 0

Agent mode tools in VS Code YouTube video by Visual Studio Code

⚡️ YouTube Video: youtu.be/VePxCcF99w4?...

@vscode.dev @anthropic.com #mcp #vscode #aiagents #tooluse

04.05.2025 00:46 👍 0 🔁 0 💬 0 📌 0

LinkedIn This link will take you to a page that’s not on LinkedIn

🍀 Note: you can use Ollama as an option for Local Private LLM along with GitHub Copilot. By default copilot uses “OpenAI” models. But you can choose other providers or use Ollama for local LLMs.

Have fun. Time to get creative now! 🎉

🔥Video: lnkd.in/g2KjDysw

04.05.2025 00:44 👍 0 🔁 0 💬 1 📌 0

🎁 Check out the video on how to use Tools with Agent Mode in Visual Studio Code (#VSCode).
It also covers:
- Promot Boost (a VSCode extension to improve your prompts on the fly while you use copilot)
- How to use MCP

04.05.2025 00:44 👍 0 🔁 0 💬 1 📌 0

🌏 What was “googling” back since 2005 until 2025, as a must have skill, will become layered by the skill to effectively use AI for while collar jobs by 2030 (or, may be sooner).

🍓 But how do you learn such a skill? By applying it and playing with it.

04.05.2025 00:44 👍 0 🔁 0 💬 1 📌 0

✨ Using AI Tools effectively is a skill - a skill you don’t get taught. A skill you acquire with practice and exposure to the technology.

🔥Video: youtu.be/VePxCcF99w4

#vscode #LLMs #ai #aiagents #tooluse #MCP #tipsnadtricks

04.05.2025 00:44 👍 1 🔁 0 💬 1 📌 0

🔥 The smolagents module has arrived in the agents course!

💻 Code agents optimised for software development
🔧 Tool calling agents that create modular, function-driven workflows
🔍 Retrieval agents designed to access and synthesise information

Course: https://buff.ly/4kcj6Ai

25.02.2025 15:40 👍 6 🔁 1 💬 0 📌 0

hot take: WebRTC should be ONE line of Python code

introducing FastRTC⚡️ from Gradio!

start now: pip install fastrtc

what you get:
- call your AI from a real phone
- automatic voice detection
- works with ANY model
- instant Gradio UI for testing

this changes everything

25.02.2025 18:08 👍 8 🔁 2 💬 1 📌 0

#Bookmark 🔥 #webrtc #python #pylib #fastrtc
WebRTC with Python library FastRTC

pip install fastrtc

25.02.2025 23:58 👍 1 🔁 0 💬 0 📌 0

🎉🎉🎉🤗🤗🤗

25.02.2025 23:56 👍 1 🔁 0 💬 0 📌 0

👏👏 I also liked the fact that the installation checks for successful installation of duckdb by running “SELECT 42” and if it does not return 42 as a result, that means duckdb was not installed.

08.02.2025 20:25 👍 4 🔁 0 💬 0 📌 0

I also liked the fact that the installation checks for successful installation of duckdb by running “SELECT 42” and if it does not return 42 as a result, that means duckdb was not installed.

08.02.2025 20:25 👍 1 🔁 0 💬 0 📌 0

Matthijs Brouns - 10x smaller docker containers for Data Science | PyData Eindhoven 2020 YouTube video by PyData

I thought I knew well how to optimize Python container size until I watched this talk from Matthijs Brouns 👇🏼

www.youtube.com/watch?v=Z1Al...

#docker #python

08.02.2025 17:16 👍 13 🔁 1 💬 1 📌 0

That’s neat! 🎉 Install #duckdb on #MacOS and #Linux with a simple command.

curl install.duckdb.org | sh

Since, I can also see the code of the shell script on the site, I can copy it and save it in a file locally and then run it. However, curl <url> | sh does exactly that!

08.02.2025 20:23 👍 2 🔁 0 💬 1 📌 0

The Illustrated DeepSeek-R1 A recipe for reasoning LLMs

🍓 Check out “The Illustrated DeepSeek-R1” by @jayalammar.bsky.social

👉 Blog: open.substack.com/pub/jayalamm...

#ml #LLMs #deepseek

30.01.2025 05:11 👍 2 🔁 0 💬 0 📌 0

The newest extremely strong embedding model based on ModernBERT-base is out: `cde-small-v2`. Both faster and stronger than its predecessor, this one tops the MTEB leaderboard for its tiny size!

Details in 🧵

14.01.2025 13:21 👍 31 🔁 7 💬 1 📌 1

LLM Eval Office Hours #3: The Importance Of Starting With Error Analysis YouTube video by Hamel Husain

New LLM Eval Office Hours, I discuss the importance of doing error analysis before jumping into metrics and tests

Links to notes in the YT description

youtu.be/ZEvXvyY17Ys?...

22.12.2024 01:10 👍 26 🔁 6 💬 1 📌 0

🔥 #OpenAI o3 model performance makes a leap, sets a new high score on the #ARCAGI benchmark.

Source: arcprize.org/blog/oai-o3-...

#ml #ai #arcagi #benchmark #openai

22.12.2024 03:10 👍 2 🔁 0 💬 0 📌 0

@ollama.bsky.social Python library introduced #FunctionCalling in version 0.4.

Ollama is an AI tool that allows you to run Large Language Models (LLMs) on device (local LLM).

Ollama blog: ollama.com/blog/functio...

#ollama #python #LLMs #ml #ai #localLLM #localai

03.12.2024 14:05 👍 5 🔁 0 💬 0 📌 0

Matt Harrison - An Introduction to Polars | PyData NYC 2024 YouTube video by PyData

An Introduction to Polars 🐻‍❄️👇🏼

The Polars workshop from the PyData NYC conference is now available online. This great workshop, by Matt Harrison, focuses on the foundation of Polars 📽️ 👇🏼

www.youtube.com/watch?v=q3o2...

#Python #Data #Polars

30.11.2024 22:52 👍 41 🔁 7 💬 2 📌 0

If you're interested in embedding models for retrieval (search), clustering, classification, paraphrase mining, etc., then there's now 10,000 fully free and open source options on @hf.co via Sentence Transformers.

Check out the most popular ones here: huggingface.co/models?libra...

29.11.2024 16:40 👍 32 🔁 7 💬 2 📌 1

GitHub - kcleal/superintervals: Fast interval intersection library Fast interval intersection library. Contribute to kcleal/superintervals development by creating an account on GitHub.

New interval lookup library just hit for Rust, C++ and Python — claims best in class performance! Pretty cool : github.com/kcleal/super...

20.11.2024 15:18 👍 42 🔁 9 💬 4 📌 0

"A Comprehensive Guide to Python Project Management and Packaging Concepts Illustrated with uv"

This Reinforced Knowledge series of Python packaging might be the most comprehensive write-up I've... ever seen?

reinforcedknowledge.com/a-comprehens...

20.11.2024 23:23 👍 67 🔁 8 💬 4 📌 1

One of the fun things you can do with uvx to just … run python.

Super useful to play around with a python package without having to make a virtualenv somewhere.

18.11.2024 22:44 👍 77 🔁 4 💬 4 📌 0

Sugato Ray

Latest posts by Sugato Ray @sugatoray