Matt Kmiecik (@mattkmiecik)

My favorite figures (to date) In my previous post, I discussed my pipeline for making publication-ready figures.

Maximizing information while minimizing clutter is key to a great figure. In this post, I cover my approach for various figures throughout the years.

Read more: mattkmiecik.substack.com/p/my-favorit...

#rstats #dataviz #science #publishing

16.06.2025 15:10 👍 3 🔁 0 💬 0 📌 0

Certainly! I wouldn't recommend going to GUI if the data will change.

14.06.2025 19:32 👍 1 🔁 0 💬 0 📌 0

Yes! Don't get me wrong, everything is possible in ggplot2 and it's extensions. But I've found it nice to polish off figures in a GUI when at the final stages or experimenting with different aesthetics/placement, etc.

14.06.2025 16:32 👍 1 🔁 0 💬 2 📌 0

Using Omnigraffle for publication ready figures In graduate school, I took an advanced statistics course that changed my life.

ggplot2 is powerful, but not always pretty. 😬

That’s why I use Omnigraffle for final figure touches—cleaner legends, custom labels, and full control over design.

Full workflow here: mattkmiecik.substack.com/p/using-omni...

#rstats #dataviz #omnigraffle #ggplot2

14.06.2025 15:10 👍 4 🔁 0 💬 2 📌 1

Scaling variables in R A walkthrough of a tip I recently learned

Just dropped a new #rstats Substack post!

It’s about a better way to scale variables in R that won’t break predict() (and keeps your model inputs traceable).

mattkmiecik.substack.com/p/scaling-va...

#Tidyverse #DataAnalysis

12.06.2025 15:10 👍 6 🔁 1 💬 0 📌 0

Great, thanks!

11.06.2025 03:10 👍 2 🔁 0 💬 0 📌 0

Is there a formal way to give yourself permission or is this implied by CC-BY?

11.06.2025 02:41 👍 1 🔁 0 💬 1 📌 0

Scaling variables in R A walkthrough of a tip I recently learned

Z-scoring in R? There’s a neat way to retain the original mean and SD after using scale(). This makes back-transforming for plots super easy.

Full example: mattkmiecik.substack.com/p/scaling-va...

#rstats #Reproducibility #CodingTips

10.06.2025 15:10 👍 5 🔁 1 💬 0 📌 0

I started this Substack just so that I could use Ulysses Alright, that might be an exaggeration.

Scientific writing has trapped me in Word & Google Docs. For personal writing, I needed something better. Ulysses is that something. Clean, simple, and dare I say... fun?

Here’s why I love it:
✍️ mattkmiecik.substack.com/p/i-started-...

#WritingCommunity #TechTools #MacApps #ProductivityHacks

08.06.2025 15:10 👍 2 🔁 1 💬 0 📌 0

🚨 Big milestone for Rdatasets 🚨

The web archive now hosts 3400+ free and documented CSV datasets. Fantastic for teaching and testing!

And {Rdatasets} is a new #RStats 📦 for easy download and search

Web archive: vincentarelbundock.github.io/Rdatasets

R 📦: vincentarelbundock.github.io/Rdatasetspkg

06.06.2025 17:39 👍 257 🔁 78 💬 3 📌 1

Scaling variables in R A walkthrough of a tip I recently learned

I used to wrap scale() in as.numeric() and call it a day. Now I know better: drop() preserves attributes and behaves cleanly.

Full walkthrough (with mtcars, lm, ggplot2): mattkmiecik.substack.com/p/scaling-va...

#rstats #DataViz #OpenScience

06.06.2025 15:14 👍 4 🔁 1 💬 0 📌 0

Personally I prefer using datawizard::standardize(), and specifically using it *in the formula*.

So
mtcars$hp_z <- scale(mtcars$hp)
mpg ~ hp_z

Becomes
mpg ~ standardize(hp)

This solves both issues you raise in your post.

#rstats @easystats.bsky.social

04.06.2025 19:33 👍 10 🔁 2 💬 1 📌 1

Nice! Thanks for commenting about datawizard::standardize(). It looks like a great solution 😀

04.06.2025 20:07 👍 0 🔁 0 💬 0 📌 0

Scaling variables in R A walkthrough of a tip I recently learned

Scaling variables in R? scale() is powerful — but its default behavior can mess with predict().

Here’s a clean fix using drop() + how to retain scaling attributes for later.

mattkmiecik.substack.com/p/scaling-va...

#rstats #DataScience #Statistics

04.06.2025 15:10 👍 6 🔁 1 💬 1 📌 0

GitHub - tidyverse/ragnar Contribute to tidyverse/ragnar development by creating an account on GitHub.

github.com/tidyverse/ra...

03.06.2025 05:29 👍 1 🔁 0 💬 0 📌 0

This has to be the best #rstats hex sticker I've ever seen

03.06.2025 05:27 👍 6 🔁 0 💬 1 📌 0

My start with R The programming language that made me a better scientist

Here's one in the other direction! mattkmiecik.substack.com/p/my-start-w...

02.06.2025 01:06 👍 0 🔁 0 💬 0 📌 0

nice

29.05.2025 23:48 👍 1 🔁 0 💬 0 📌 0

Too funny! 🤣

27.05.2025 19:56 👍 1 🔁 0 💬 0 📌 0

Labeling facets in ggplot2 Who knew such a small setting could make such a big improvement

One of my favorite tricks I’ve learned from ChatGPT: better facet labels in ggplot2.

Check out the latest post: mattkmiecik.substack.com/p/labeling-f...

#rstats #ggplot2 #dataviz

27.05.2025 18:23 👍 3 🔁 0 💬 1 📌 0

22.05.2025 18:55 👍 82 🔁 12 💬 1 📌 0

Labeling facets in ggplot2 Who knew such a small setting could make such a big improvement

Adding labeller = label_both to your facet calls in ggplot2 can massively improve figure readability! I wrote a quick blogpost with a use case and some examples.

mattkmiecik.substack.com/p/labeling-f...

#rstats #ggplot2 #VisualizationTips #dataviz

23.05.2025 15:45 👍 2 🔁 1 💬 0 📌 0

Most applications in R run pretty quickly. But these days, many R users are faced with huge datasets and/or long-running applications, making parallel computation attractive. 🧵 1/

21.05.2025 02:37 👍 24 🔁 8 💬 1 📌 0

Labeling facets in ggplot2 Who knew such a small setting could make such a big improvement

Struggling with confusing facet labels in ggplot2? I just shared a quick fix that makes your plots clearer using labeller = label_both.

mattkmiecik.substack.com/p/labeling-f...

#rstats #dataviz #ggplot2 #DataScience

21.05.2025 15:10 👍 3 🔁 0 💬 0 📌 0

Named vectors in R A powerful tool for plotting colors

R is full of surprises—even after a decade, I’m still learning simple but powerful tricks. Named vectors are my latest discovery, thanks to ChatGPT!

mattkmiecik.substack.com/p/named-vectors-in-r

#RStats #Learning #ChatGPT #Dataviz

17.05.2025 15:10 👍 2 🔁 0 💬 0 📌 0

Thank you so much Ben! I'm looking forward to giving these a read

16.05.2025 22:49 👍 0 🔁 0 💬 0 📌 0

Will do! Thanks again

16.05.2025 04:49 👍 2 🔁 0 💬 0 📌 0

Thanks Norm! I recently discovered the Tweedie distribution; this has really improved things for some of the data I am working with

16.05.2025 04:41 👍 0 🔁 0 💬 1 📌 0

Great, thank you! Would you recommend any texts about what sort of distributions are available and how to interpret the resultant coefficients?

15.05.2025 22:08 👍 0 🔁 0 💬 1 📌 0

I'm looking for literature recommendations on guidance for specifying distributional assumptions in generalized linear/additive models. I've mainly stuck with gaussian and binomial distributions, but am looking to explore other distributions to enhance specification. Thank you! #rstats

15.05.2025 17:50 👍 1 🔁 0 💬 2 📌 0

Matt Kmiecik

Latest posts by Matt Kmiecik @mattkmiecik