Pete Cheslock's Avatar

Pete Cheslock

@petecheslock.com

๐Ÿฅฉ He/Him ๐Ÿ– "Anything worth doing is worth overdoing."

816
Followers
60
Following
24
Posts
27.04.2023
Joined
Posts Following

Latest posts by Pete Cheslock @petecheslock.com

๐Ÿ“ข ๐—ง๐—ต๐—ฒ ๐—ฆ๐˜๐—ฎ๐˜๐—ฒ ๐—ผ๐—ณ ๐— ๐—ผ๐—ฑ๐—ฒ๐—น ๐—ฆ๐—ฒ๐—ฟ๐˜ƒ๐—ถ๐—ป๐—ด ๐—–๐—ผ๐—บ๐—บ๐˜‚๐—ป๐—ถ๐˜๐—ถ๐—ฒ๐˜€: ๐— ๐—ฎ๐—ฟ๐—ฐ๐—ต ๐—˜๐—ฑ๐—ถ๐˜๐—ถ๐—ผ๐—ป ๐—ถ๐˜€ ๐—ผ๐˜‚๐˜!

We launched our newsletter publicly last year to share our contributions to upstream communities from our Red Hat AI teams. Weโ€™ve gained over ๐Ÿญ๐Ÿฏ๐Ÿฌ๐Ÿฌ ๐˜€๐˜‚๐—ฏ๐˜€๐—ฐ๐—ฟ๐—ถ๐—ฏ๐—ฒ๐—ฟ๐˜€!

09.03.2026 18:55 ๐Ÿ‘ 2 ๐Ÿ” 2 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0

Jayson Tatum looking like Jayson Tatum is a horrifying development for the rest of the East.

08.03.2026 17:26 ๐Ÿ‘ 53 ๐Ÿ” 3 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 2

I'm going to be in NYC next week, come and join me at the first llm-d meetup.

If you're looking to learn more about distributed inferencing on kubernetes, this is going to be the place to be.

02.03.2026 16:01 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Optimizing LLM Workloads: A Deep Dive into the GPU Recommendation Tool & Configuration Explorer
Optimizing LLM Workloads: A Deep Dive into the GPU Recommendation Tool & Configuration Explorer YouTube video by llm-d Project

In the latest llm-d release, weโ€™re tackling high hardware costs with the new GPU Recommendation Tool! ๐Ÿ“ˆ

Evaluate throughput, latency, and cost-effectiveness before requesting expensive cluster resources.

Check out the full demo: www.youtube.com/watch?v=Y26i...

24.02.2026 19:17 ๐Ÿ‘ 2 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Come and join us for the first llm-d meetup in NYC!

16.02.2026 17:14 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Preview
Distributed Inference Meetup NYC ยท Luma llm-d Distributed Inference Meetup NYC Hosted by Red Hat AI, IBM Research, and AMD, this event takes place on March 11, 2026 in New York City. What toโ€ฆ

The agenda is still evolving, and weโ€™ve got even more awesomeness in the works! ๐Ÿ“ˆ

Whether you're running GenAI in production or building the platforms to support it, this is the room to be in.

๐Ÿ“… March 11 | 4:30 PM
๐Ÿ“ 1 Madison Ave, NYC
๐ŸŽŸ๏ธ RSVP: luma.com/0crwqwg4

16.02.2026 17:13 ๐Ÿ‘ 0 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 1
[Announcement] WG Serving Has Succeeded and Will Be Disbanded

We'd like to announce that @kubernetes.io WG Serving has succeeded and will be disbanded! Thank you everyone who have participated and contributed to the discussions and initiatives!

More details: groups.google.com/a/kubernetes...

13.02.2026 15:28 ๐Ÿ‘ 4 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1

The most cursed venn diagram

09.02.2026 17:59 ๐Ÿ‘ 5 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

In case you missed it, last week the llm-d community shipped the v0.5 release.

Check out the post from the llm-d project owners to learn more about all the features we've included in this release.

llm-d.ai/blog/llm-d-v...

09.02.2026 17:52 ๐Ÿ‘ 1 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

๐Ÿ“ข ๐—ง๐—ต๐—ฒ ๐—ฆ๐˜๐—ฎ๐˜๐—ฒ ๐—ผ๐—ณ ๐— ๐—ผ๐—ฑ๐—ฒ๐—น ๐—ฆ๐—ฒ๐—ฟ๐˜ƒ๐—ถ๐—ป๐—ด ๐—–๐—ผ๐—บ๐—บ๐˜‚๐—ป๐—ถ๐˜๐—ถ๐—ฒ๐˜€: ๐—™๐—ฒ๐—ฏ๐—ฟ๐˜‚๐—ฎ๐—ฟ๐˜† ๐—˜๐—ฑ๐—ถ๐˜๐—ถ๐—ผ๐—ป ๐—ถ๐˜€ ๐—ผ๐˜‚๐˜!

We launched our newsletter publicly last year to share our contributions to upstream communities from our Red Hat AI teams. Weโ€™ve gained over ๐Ÿญ๐Ÿฎ๐Ÿฌ๐Ÿฌ ๐˜€๐˜‚๐—ฏ๐˜€๐—ฐ๐—ฟ๐—ถ๐—ฏ๐—ฒ๐—ฟ๐˜€!

09.02.2026 14:46 ๐Ÿ‘ 1 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
llm-d 0.5: Sustaining Performance at Scale | llm-d Announcing the llm-d 0.5 release

๐Ÿ—๏ธ llm-d v0.5: Sustaining Performance at Scale In our last release, we focused on breaking latency records.

With v0.5, weโ€™re shifting from peak performance to the operational rigor required to sustain those gains in production.

๐Ÿงต๐Ÿ‘‡

llm-d.ai/blog/llm-d-v...

05.02.2026 15:32 ๐Ÿ‘ 1 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1
Preview
Inside vLLMโ€™s New KV Offloading Connector: Smarter Memory Transfer for Maximizing Inference Throughput In this post, we will describe the new KV cache offloading feature that was introduced in vLLM 0.11.0. We will focus on offloading to CPU memory (DRAM) and its benefits to improving overall inferenceโ€ฆ

Standardizing high-performance inference requires deep ecosystem collaboration. ๐Ÿš€

Huge shoutout to @vllm_project and @IBMResearch on the new KV Offloading Connector. Weโ€™re seeing up to 9x throughput gains on H100s and massive TTFT reductions. ๐Ÿงต

blog.vllm.ai/2026/01/08/k...

09.01.2026 18:45 ๐Ÿ‘ 0 ๐Ÿ” 1 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
LLMโ€‘D Explained: Building Nextโ€‘Gen AI with LLMs, RAG & Kubernetes
LLMโ€‘D Explained: Building Nextโ€‘Gen AI with LLMs, RAG & Kubernetes YouTube video by IBM Technology

AI inference is like a busy airport: without a controller, you get gridlock. โœˆ๏ธ

Check out this breakdown by Cedric Clyburn from Red Hat on how llm-d intelligently routes distributed LLM requests.

๐Ÿ”น Solves "round robin" congestion
๐Ÿ”น Disaggregates P/D to save costs

www.youtube.com/watch?v=CNKG...

08.01.2026 19:21 ๐Ÿ‘ 1 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

If you stop to think about it, Geysers are just Earth farts.

25.04.2025 21:13 ๐Ÿ‘ 2 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

I'm unaware of any alternative pronunciations for it.

25.04.2025 21:09 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

We're so young and full of life!

16.03.2025 23:39 ๐Ÿ‘ 7 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
GAGGIUINO Gaggiuino is a community-driven project to add profiling, temp control, and other high-end features to Gaggia espresso machines

Honestly, if you want a project you could buy a Gaggia Classic Pro for $499 (or a used one cheaper) and go the gaggiuino route.
gaggiuino.github.io#/?id=home
aftermath.site/gaggiuino-ga...

04.02.2025 15:15 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Yea - they are shockingly close to ones I've seen in the past. Maybe if i get some time in the future and go and find some live ones. One I remember used the word "crunchy" to describe the headphones and i was reminded of a wine review for "chewy tannins". Hilarious

29.01.2025 20:56 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

Well... that's the joke! They are all written to be basically as generic as possible so that they could equally apply to either wine or headphones. I seeded the vote counts out of the gate so that you wouldn't have like 1 or 2 votes swing the graph too much, but here's an example post-vote.

28.01.2025 22:18 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Video thumbnail

So I decided to make a game out of this. Welcome to "Bottles or Cans" where you can read a review and guess if its for a bottle of wine or for a pair of headphones (aka cans).

bottlesorcans.com

28.01.2025 19:04 ๐Ÿ‘ 14 ๐Ÿ” 2 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 1

So a long time ago when buying new headphones and reading reviews, I noticed how the reviews often sounded similar to reviews for a bottle of wine. Like:

"Rich and full-bodied with excellent depth. The bass notes are particularly impressive, with a smooth finish that lingers pleasantly."

28.01.2025 19:04 ๐Ÿ‘ 2 ๐Ÿ” 1 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0

not that i'm sure you need ANOTHER one to look at, BUT i'm a big fan of the recteq smokers too. I agree to pass on the Traegers, i've seen too many literally go up in flames.

29.12.2024 15:39 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0
Post image Post image Post image Post image
03.07.2023 00:35 ๐Ÿ‘ 6670 ๐Ÿ” 1755 ๐Ÿ’ฌ 43 ๐Ÿ“Œ 43
How do you pronounce โ€œwwwโ€ the abbreviation for โ€œWorld Wide Webโ€?  #shorts
How do you pronounce โ€œwwwโ€ the abbreviation for โ€œWorld Wide Webโ€? #shorts #www #sysadmin #sysadminlife #devops #sre #pronunciation #tutorial #software #opensource #developers #techtok

How do you pronounce โ€œwwwโ€ the abbreviation for โ€œWorld Wide Webโ€?

https://youtube.com/shorts/MxuX7M661Hg

#www #sysadmin #devops #sre #pronunciation #tutorial #software #developers

03.07.2023 13:55 ๐Ÿ‘ 3 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1
How do you pronounce "sudo" the #linux/#unix command? #shorts
How do you pronounce "sudo" the #linux/#unix command? #shorts How do you pronouce "sudo" the #linux/#unix command? #sudo #sysadmin #sysadminlife #devops #sre #pronounciation #tutorial #software #opensource #developers #...

How do you pronounce "sudo" the #linux/#unix command?

So are you team "Su DOUGH" or team "Su DOOOO"

https://www.youtube.com/shorts/qpi5wYblQfY

25.06.2023 12:07 ๐Ÿ‘ 3 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
There is no agreed upon way to pronounce \
There is no agreed upon way to pronounce \ #techtips #data #softwareengineer #developer #devops #sysadmin #software #linux #techlife #tutorial #pronounciation #shorts #linux

This is probably the most requested pronunciation video i've gotten.

How do you say: "fsck" - a.k.a - File System Check.

There is no agreed upon pronunciation of this one!

https://youtube.com/shorts/7b-X6MJGkdA

#linux #sysadmin #devops #sre

26.05.2023 17:09 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Have you heard of JWT before? But how would YOU pronounce it? #shorts #software  #howdoyousay
Have you heard of JWT before? But how would YOU pronounce it? #shorts #software #howdoyousay Have you heard of JWT before? But how would YOU pronounce it? #shorts #software #howdoyousay #tech #softwareengineering #softwaretutorials

Another episode of โ€œHow do you sayโ€. This one is definitely one of my favorites.

How do you say JWT (JSON Web Token)?

https://youtube.com/shorts/D2D9umQMKhA?feature=share

20.05.2023 12:30 ๐Ÿ‘ 2 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
petecheslock on TikTok Did you know there are 3 ways to pronouce #SQL? #techtok #data #softwareengineer #developer #tech #code

Did you know there are at least 3 (THREE) different ways to say SQL?

https://www.tiktok.com/t/ZTRK9rNSh/

13.05.2023 12:14 ๐Ÿ‘ 1 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
petecheslock on TikTok Did you know there are 3 ways to pronouce #SQL? #techtok #data #softwareengineer #developer #tech #code

Did you know there are at least 3 (THREE) different ways to say SQL?

https://www.tiktok.com/t/ZTRK9rNSh/

13.05.2023 12:14 ๐Ÿ‘ 1 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
petecheslock on TikTok Hey #techtok How do you sayโ€ฆ โ€œepochโ€ https://en.m.wikipedia.org/wiki/Epoch_(computing) #p#pronouncet#technologys#softwaredevelopers

How do you sayโ€ฆ.. โ€œEpochโ€

Turns out this one was heavily contested on pronounciation.

https://www.tiktok.com/t/ZTRws7M3b/

05.05.2023 21:23 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0