Theo's Avatar

Theo

@theukka

Trying to couple the R and CUDA together with https://github.com/bbtheo/cuplyr

41
Followers
267
Following
14
Posts
20.11.2024
Joined
Posts Following

Latest posts by Theo @theukka

tidychain

new package alert! {tidychain} is a #rstats packaged inspired by the below authors experience in showing how an excel file was changed / manipulated by looking at the underlying xml files to prove fraud in research

datacolada.org/109package

usrbinr.codeberg.page/tidychain/

01.03.2026 04:35 πŸ‘ 69 πŸ” 21 πŸ’¬ 2 πŸ“Œ 1

bracketeer is now in CRAN!!

27.02.2026 07:05 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

#rstats Can the people complaining about R pipes not being "functional" enough please go hang out in the SQL forums? If any language needs help with ease of debugging and interpreting it's SQL.

Thank you and please.

26.02.2026 16:05 πŸ‘ 16 πŸ” 6 πŸ’¬ 1 πŸ“Œ 1

I'm not sure where the threshold is for roundtripping the data for a single operation. But there is a substantial speedup even on smaller datasets when the data is kept on gpu and repeated operations are run, like hypothesis testing.

25.02.2026 09:17 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Agree completely. This is why I implemented glimpse style printing as the default for tbl_gpus in cuplyr

25.02.2026 06:17 πŸ‘ 4 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Thanks man! Really appreciate it

23.02.2026 11:29 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Trying to stay neutral in the #rstats flame wars but my side projects are:

* model tournament structures using |>
* running tibbles on gpus

22.02.2026 19:17 πŸ‘ 10 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0
Preview
Google Colab

Packages pipe-first API where you chain stages and control who advances. Build complex tournament dynamics with zero dependencies beyond base R β€” lightweight enough for Shiny internals.

Try the World Cup 2026 simulation:

colab.research.google.com/github/bbthe...

20.02.2026 10:55 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Tournament Generator Create and manage tournament brackets for various competition formats including single elimination, double elimination, round robin, Swiss system, and group-stage-to-knockout tournaments. Provides too...

Started building {bracketeer} because I needed to run a pΓ©tanque tournament for friends and couldn't believe this wasn't solved already.

Turns out tournament modeling is harder than it first looks.

bbtheo.github.io/bracketeer/
#rstats

20.02.2026 10:51 πŸ‘ 4 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
Google Colab

Want to try it without installing anything? This notebook simulates a full 48-team FIFA World Cup 2026 in your browser. 12 groups, 104 matches, custom qualification rules, all the way to a champion.

colab.research.google.com/github/bbtheo/bracketeer/blob/main/notebooks/world_cup_2026.ipynb

19.02.2026 12:04 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - bbtheo/bracketeer Contribute to bbtheo/bracketeer development by creating an account on GitHub.

Pub quiz. Office ping pong. 10,000 simulated World Cups. A Champions League bracket for your fantasy league.

tournament(teams) |>
swiss("open", rounds = 5) |>
single_elim("playoffs", take = top_n(8))

bracketeer 0.1.0 tournament modeling in base #Rstats
github.com/bbtheo/bracketeer

19.02.2026 12:00 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Not really as it is completely reliant on cuda and the cudf library which are sadly locked to Nvidia gpus.

11.02.2026 16:27 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

DuckDB is an amazing tool that I also daily drive! I just wanted to see if I could put my GPU to work and it's been a fun experiment. Works better than I expected.

11.02.2026 11:39 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
GitHub - bbtheo/cuplyr: GPU powered dataframes in R GPU powered dataframes in R. Contribute to bbtheo/cuplyr development by creating an account on GitHub.

cuplyr version 0.1.0 is now out!

A GPU-accelerated dplyr backend for R, powered by RAPIDS cuDF.

Write familiar tidyverse code, execute on GPU. Lazy eval with AST optimization.

In my benchmarks 60x faster than dplyr on 50M rows.

github.com/bbtheo/cuplyr
#rstats #cuda #DataScience

10.02.2026 21:47 πŸ‘ 44 πŸ” 11 πŸ’¬ 1 πŸ“Œ 1

tstats-project.org

08.02.2026 11:50 πŸ‘ 11 πŸ” 3 πŸ’¬ 0 πŸ“Œ 1
shikokuchuo{net}: secretbase: The 40KB Hash Package Streaming cryptographic hashes for R

New post: the design behind secretbase, a 40KB streaming hash package I maintain.

The entire source package is smaller than most hex sticker PNGs.

Hash large objects with constant memory. Powers targets pipelines. Plus CBOR encoding comes to R.

shikokuchuo.net/posts/28-int...

#RStats #OpenSource

04.02.2026 16:02 πŸ‘ 11 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0
Post image

The first data science book that has a chapter on monads reproducible-data-science.dev

Learn how to build robust #DataScience pipelines with #RStats, #Python , #Julia and #Nix !

01.02.2026 11:47 πŸ‘ 26 πŸ” 9 πŸ’¬ 0 πŸ“Œ 2

As alway it is quite iteresting when working with C++ dependecies. But I use Pixi as a package manager which has made it relatively painless.

30.01.2026 08:56 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
GitHub - bbtheo/cuplyr: GPU powered dataframes in R GPU powered dataframes in R. Contribute to bbtheo/cuplyr development by creating an account on GitHub.

What if your dplyr pipelines ran on GPU?

That's what I built with cuplyr! A CUDA-powered backend for #rstats data manipulation. Looking for testers and feedback!

github.com/bbtheo/cuplyr

29.01.2026 17:56 πŸ‘ 32 πŸ” 9 πŸ’¬ 1 πŸ“Œ 0