Trending

#dataprepkit

Latest posts tagged with #dataprepkit on Bluesky

Latest Top
Trending

Posts tagged #dataprepkit

Post image

I am running a workshop at QConSF on Nov 20, in SF.

"Open Source Rag Pipeline With Docling + Data Prep Kit + Milvus + Open LLMs"

You will walk away with working code you can build on.

qconsf.com/training/no...

#QConSF @qconferences.com #Milvus #RAG #DataPrepKit #Docling #NebiusAIStudio

0 1 1 0
Data Prep Kit - pdf processing 1
Data Prep Kit - pdf processing 1 YouTube video by Sujee Maniyam

- PDF processing with DPK
Code walkthrough on how to process PDF documents (parse, dedupe, filter out spam)

πŸŽ₯video: youtu.be/u4OgkmG94fs?...

πŸ’» Data prep kit examples : github.com/data-prep-ki...

#dataprepkit

0 0 0 0
Data Prep Kit Intro 1
Data Prep Kit Intro 1 YouTube video by Sujee Maniyam

Data Prep Kit videos:

- Data Prep Kit Intro
Introduction and feature walk through (document parsing, exact and fuzzy de-duping, chunking, vectorizing, PII removal, document quality)

πŸŽ₯ video : www.youtube.com/watch?v=wCbM...

πŸ’» Data prep kit examples : github.com/data-prep-ki...

#dataprepkit

0 0 1 0
Preview
Mastering Data Cleaning for Fine-Tuning LLMs and RAG Architectures | AI Alliance In the rapidly advancing field of artificial intelligence, data cleaning has become a mission-critical step in ensuring the success of Large Language Models (LLMs) and Retrieval-Augmented Generation (...

Good read: "Mastering Data Cleaning for Fine-Tuning LLMs and RAG Architectures"
thealliance.ai/blog/masteri...

@aialliance.bsky.social @davenielsen.bsky.social

#dataprepkit #RAG #dataprep #finetuning

3 0 0 0
Data Prep Kit Intro 1
Data Prep Kit Intro 1 YouTube video by Sujee Maniyam

Check out Data Prep Kit (DPK) β€” an open-source tool to simplify your data wrangling tasks.

πŸ“Ί Intro video: www.youtube.com/watch?v=wCbM...

πŸ”— GitHub: github.com/data-prep-ki...

#dataprepkit

1 0 0 0
Post image

my upcoming talk: Create High-Quality Datasets by Filtering Out Spam, HAP (Hate, Abuse, Profanity) Speech, and Sensitive Data

πŸ—“οΈ: Thursday Mar 27, 2025
⏰: 9am PST / 12pm EST

Register: lnkd.in/gfRQyzKZ

#dataprepkit #LLM #AIAlliance

0 0 0 0