Gordon's Avatar

Gordon

@gordonmurray.com

AWS Systems Engineer | Cloud Infrastructure and Terraform Enthusiast | Focused on scalable, secure and compliant solutions | Always exploring new tools GitHub https://github.com/gordonmurray

66
Followers
154
Following
49
Posts
25.08.2023
Joined
Posts Following

Latest posts by Gordon @gordonmurray.com

Deploy app servers close to your users · Fly

I really like using fly.io for small projects. I like knowing cost up front though, so I wrote a CLI tool called flycost.
Like www.infracost.io but for fly.io. It reads your fly.toml file and gives a monthly cost estimate
I’d love feedback from anyone running on Fly.io
github.com/gordonmurray...

16.09.2025 20:01 👍 0 🔁 0 💬 0 📌 0
notebare.com

I built a simple to-do list app called www.notebare.com. All the data is stored in Lance format from @lancedb. Performance seems quick, Anyone can try it out, you can sign in with Github. Feedback more than welcome!

15.09.2025 19:21 👍 0 🔁 0 💬 0 📌 0
Preview
GitHub - gordonmurray/apache_flink_and_docker_compose: A complete real-time Change Data Capture (CDC) pipeline using Apache Flink, MariaDB, and Docker Compose. This project demonstrates how to build a... A complete real-time Change Data Capture (CDC) pipeline using Apache Flink, MariaDB, and Docker Compose. This project demonstrates how to build a modern streaming analytics system that processes da...

Cleaned up my old project: CDC from MariaDB → Flink → back into MariaDB using Docker Compose. Flink’s CDC connectors (powered by Debezium) keeps the target table up to date in real time. Code on GitHub: github.com/gordonmurray...

07.09.2025 17:17 👍 2 🔁 0 💬 0 📌 0
Preview
GitHub - gordonmurray/apache_fluss_flink_and_paimon: This project integrates Apache Fluss (stream-batch unified storage) with Apache Paimon (data lake storage) using Apache Flink This project integrates Apache Fluss (stream-batch unified storage) with Apache Paimon (data lake storage) using Apache Flink - gordonmurray/apache_fluss_flink_and_paimon

Apache Fluss is a storage engine:
• Fast key lookups (e.g. user status)
• Instant historic queries (via Paimon on S3/ORC)
• Works with Flink SQL for heavy lifting
• Tablets replicate & recover automatically

Made a small POC to try it out: github.com/gordonmurray...

05.09.2025 17:29 👍 1 🔁 0 💬 0 📌 0
Preview
GitHub - gordonmurray/apache_flink_and_paimon: Trying out Apache Paimon with Apache Flink using Docker Compose Trying out Apache Paimon with Apache Flink using Docker Compose - gordonmurray/apache_flink_and_paimon

I’ve refreshed a small hobby project from ~2 years ago: Apache Flink & Paimon.
It picked up a few early stars back then. I’ve cleaned it up so it’s runnable again.
I’m hoping to experiment with Apache Fluss on top of this. Code on Github github.com/gordonmurray...

04.09.2025 19:02 👍 1 🔁 0 💬 0 📌 0
Preview
GitHub - gordonmurray/fastapi-temporal-logic-of-actions: A small introduction to verification using TLA+ through a file upload idempotency problem A small introduction to verification using TLA+ through a file upload idempotency problem - gordonmurray/fastapi-temporal-logic-of-actions

I took some time this weekend to learn TLA+. It lets you test distributed systems by exploring all possible states. Meaning you can catch bugs before production and likely save costs. I built a small FastAPI + MinIO demo to try it out. Repo: github.com/gordonmurray...

17.08.2025 17:56 👍 0 🔁 0 💬 0 📌 0
When Lance Hits the Wall at 70 Images on Cloudflare R2 | Gordon Murray First impressions of using Lance with Python were excellent. It took almost no code at all to upload images, vectorize them, save the data in Lance format in...

I tried to build a minimal image vector store using Lance and Cloudflare R2. I Learned a lot, hit a hard limit fast.

"When Lance hits a wall at 70 images"

gordonmurray.com/data/2025/05...

14.05.2025 12:56 👍 1 🔁 0 💬 0 📌 0
Preview
IaCConf - The First Community-Driven IaC Conference | May 15, 2025 IaCConf is the first community-driven virtual IaC conference. It will feature discussions on Infrastructure as Code trends and sessions with industry leaders.

Oooh www.iacconf.com

10.05.2025 08:55 👍 0 🔁 0 💬 0 📌 0
QuickVectors - Image Vectorization

Trying out Lance for storing vector data on R2. I’ve got Python saving the image and its CLIP embedding to a Lance dataset on R2. It’s append-only, columnar format kind of like Parquet, but optimized for updates, metadata, and search. Compared to Iceberg, feels like a better fit for vector mappings

09.05.2025 19:54 👍 1 🔁 0 💬 0 📌 0
Post image

I made a thing: upload an image, get back a vector embedding. That’s it.

Built with Python, FastAPI, and OpenAI’s CLIP model (ViT-B/32). Deployed on Fly.io.

quickvectors.com

Want to try it out? DM me for an API key.

08.05.2025 22:46 👍 0 🔁 0 💬 0 📌 0
Building A Digital Asset Management Project With Different Tools | Gordon Murray Building a Digital Asset Management (DAM) project with different tools

Built a small DAM-style pipeline to handle image uploads, async processing, and ML enrichment. No DB, just new tools I’ve been meaning to try (fly.io, R2, Parquet). Wrote a post about what I’ve built so far and what’s next gordonmurray.com/2025/05/05/B...

05.05.2025 20:57 👍 0 🔁 0 💬 0 📌 0
Preview
GitHub - gordonmurray/dam-pipeline-bento: A decoupled processing pipeline for a DAM using FastAPI, RabbitMQ, Cloudflare R2, and BentoML + CLIP for image vectorization A decoupled processing pipeline for a DAM using FastAPI, RabbitMQ, Cloudflare R2, and BentoML + CLIP for image vectorization - gordonmurray/dam-pipeline-bento

Built a Python pipeline to upload images to R2 and generate captions + vectors using BLIP and CLIP from Hugging Face. Metadata stored as Parquet. Code on GitHub: github.com/gordonmurray...

02.05.2025 22:42 👍 0 🔁 0 💬 0 📌 0

Trying out Fly.io and Cloudflare R2 - long overdue on my part. Made a small Python app to receive an image, store it in R2 with Parquet metadata, and query via DuckDB+SQL.

Very basic, but next I want to try R2's data catalog / managed Apache Iceberg.

Code on github github.com/gordonmurray...

30.04.2025 21:23 👍 1 🔁 0 💬 0 📌 0
Preview
GitHub - gordonmurray/terraform_aws_deepseek: A terraform project to create an EC2 instance running Deepseek R1 A terraform project to create an EC2 instance running Deepseek R1 - gordonmurray/terraform_aws_deepseek

I created a small Terraform project that will set up Deepseek R1 for you on your own GPU EC2 instance, seems to be uncensored too. Code on Github at github.com/gordonmurray...

I took the steps from this great AWS Community guide community.aws/content/2sEu...

29.01.2025 22:36 👍 0 🔁 0 💬 0 📌 0
GitHub - gordonmurray/bentoml-image-vectorization: This project offers an API for vectorizing images using OpenAI's CLIP model via BentoML and integrates Milvus for efficient vector storage and search This project offers an API for vectorizing images using OpenAI's CLIP model via BentoML and integrates Milvus for efficient vector storage and search - gordonmurray/bentoml-image-vectorization

Updated my BentoML project to include Milvus for storing and searching vectorized images. More to learn about L2 Euclidean distance for better search results
github.com/gordonmurray...

16.12.2024 22:54 👍 1 🔁 0 💬 0 📌 0
A line diagram of making a Curl request to BentoML on ECS, protected by WAF

A line diagram of making a Curl request to BentoML on ECS, protected by WAF

Just finished a small project on deploying BentoML on ECS using Terraform. Perfect for creating vectors from images with a CLIP model I’ve been learning. If you’re getting in to machine learning I'd recommend BentoML, hopefully this is a useful starting point!
github.com/gordonmurray...

14.12.2024 21:36 👍 3 🔁 0 💬 0 📌 0
Post image

Its surprising how many requests and countries try an API endpoint that has only just come online in the last few minutes, using a brand new domain name just registered too.

13.12.2024 19:21 👍 1 🔁 0 💬 0 📌 0
Preview
GitHub - gordonmurray/bentoml: This project provides an API for vectorizing images using OpenAI's CLIP model, packaged and served with BentoML. It allows you to send images to the API and receive a ve... This project provides an API for vectorizing images using OpenAI's CLIP model, packaged and served with BentoML. It allows you to send images to the API and receive a vector representation that...

Tried out BentoML this evening to vectorize images, and it works really well! I expected to need a GPU, but it runs smoothly on a CPU. It even exposes Prometheus metrics out of the box. Code on GitHub: github.com/gordonmurray...

10.12.2024 23:01 👍 0 🔁 0 💬 0 📌 0

Ah this is great, I didn’t know this was available. The DDB approach was fine but this would be nicer

10.12.2024 08:29 👍 0 🔁 0 💬 0 📌 0
Preview
The AWS News Feed Keep track of AWS news and blogs. See what's hot, read summaries, search, set bookmarks and ratings, and categorize articles with tags.

Re:Invent hasn't even officially started yet and we've already got 22 announcements. No rest on Re:Invent Sunday, I guess. To help you keep track, I will curate a number of custom feeds. The first one: All Re:Invent 2024 Releases

Check it out below 👇

01.12.2024 22:52 👍 5 🔁 2 💬 0 📌 0

The “show vehicle motion cues” option on iOS really works! For me at least. I can read away as a passenger and not feel queasy. Really cool 📖

30.11.2024 16:34 👍 1 🔁 0 💬 0 📌 0
Preview
Writing is the Thinking Remember: [[Writing]] is the thinking. The more you write, the more notes you take, and the better of a thinker you become. You’ll read better with better questions in your mind. Without writing, ther...

If you’re thinking without writing, you only think you’re thinking.—Leslie Lamport

30.11.2024 15:44 👍 26 🔁 5 💬 0 📌 0
A screenshot of some JSON in a readme, using json5 so that comments can be added to the json snippet

A screenshot of some JSON in a readme, using json5 so that comments can be added to the json snippet

The system works!

github.com/gordonmurray...

29.11.2024 22:02 👍 1 🔁 0 💬 0 📌 0

Oh nice. I didn’t know that. Thanks! I have some repos to go and update

29.11.2024 20:14 👍 8 🔁 3 💬 0 📌 0

Today is as good day to start learning Terraform

28.11.2024 09:22 👍 1 🔁 0 💬 0 📌 0
Post image

Hmm 🤔 Maybe I just created the AWS News Firehose on Bluesky. Maybe it's hosted at https://buff.ly/4g3kQsK 🦋 And maybe it's related to Bluesky's recent posts-as-comments demo... 🤷‍♀️ Who knows!

26.11.2024 22:47 👍 14 🔁 3 💬 3 📌 0
Preview
Ultimate Directory of Apache Iceberg Resources This article is a comprehensive directory of Apache Iceberg resources, including educational materials, tutorials, and hands-on exercises. Whether you’re a beginner or an experienced data engineer…

Very cool resource for learning about Apache Iceberg. I’d like something similar for Apache Hudi too

“Ultimate Directory of Apache Iceberg Resources” medium.com/data-enginee...

24.11.2024 23:09 👍 1 🔁 0 💬 0 📌 0

👍Ideally I’d set it up using Terraform though I assume it isn’t supported there yet.
I’ll set it up manually and see how it goes and write a post on it

19.11.2024 19:20 👍 0 🔁 0 💬 0 📌 0

I’ll give it a go too. I’m curious to compare it to using debezium and flink to do the same work

16.11.2024 20:05 👍 2 🔁 0 💬 1 📌 0
Preview
Replicate changes from databases to Apache Iceberg tables using Amazon Data Firehose (in preview) | Amazon Web Services Amazon Data Firehose introduces a new capability that captures database changes and streams updates to a data lake or warehouse, supporting PostgreSQL, MySQL, Oracle, SQL Server, and MongoDB, with aut...

Ooh that’s kinda cool. CDC from MySQL directly to Iceberg on S3 using AWS Firehose.

I’m a big fan of debezium though this could be cool to try out. It’s might lend itself nicely to be managed in Terraform too.

aws.amazon.com/blogs/aws/re...

16.11.2024 17:02 👍 2 🔁 0 💬 1 📌 0