Trending

#TokenizerFree

Latest posts tagged with #TokenizerFree on Bluesky

Latest Top
Trending

Posts tagged #TokenizerFree

Post image

Back at itโ€”system gave us 500 gemsโ€ฆ and 10ร— more junk ๐Ÿ˜‚. Quick tweaks and weโ€™re nearly done with stage one: mining pretrain data from rare, cross-domain PDFs.

#AIpretrain #SpanAware #TokenizerFree #PDFMining #XSpanformer #DataCuration #OpenScience
#artificalintelligence

0 0 1 0

๐Ÿง  X-Spanformer ditched "improver"โ€”now guided by 5-judge consensus ๐Ÿ—ณ๏ธ to approve text for ox-bar span compilation. Cleaner segments. Swarm decides.

#ai #artificialintelligence #transformers #ltsm #computerscience #XSpanformer #TokenizerFree #SpanAware #SemanticEmbeddings #OxBarTheory #TauSystem ๐Ÿ„

1 1 0 0
Post image

๐Ÿšง Building out the pretrain pipeline for X-Spanformer: github.com/p3nGu1nZz/x-... /// PDF segmentation + judge/improver enrichment for Tau2.0 tokenizer. Zero tokens. All spans. #AI #TokenizerFree #TauSystems #NLP #TransformerArchitecture #OpenSource #FungalLogic #SpanAware #XBarTheory

4 1 0 0
Preview
GitHub - p3nGu1nZz/x-spanformer: Tokenizer-free, span-aware encoder architecture inspired by X-bar theory. Jointly learns segmentation and representation using pointer networks and compositional spans... Tokenizer-free, span-aware encoder architecture inspired by X-bar theory. Jointly learns segmentation and representation using pointer networks and compositional spans. - p3nGu1nZz/x-spanformer

๐Ÿง  Back from break + back on code. Diving into X-Spanformer, a tokenizer-free, span-aware encoder built with X-bar theory magic.

๐Ÿ”— github.com/p3nGu1nZz/x-...

#AI #software #BiomimeticComputing #TokenizerFree #StructuredLearning #NeuromorphicDesign #XBarTheory #OpenSource #SemanticEmbedding

5 1 0 0
Post image Post image

Up next on stage, Dr. @edoardo-ponti.bsky.social ( @edinburgh-uni.bsky.social / NVIDIA)
๐ŸŽค โ€œAdaptive Units of Computation: Towards Sublinear-Memory and Tokenizer-Free Foundation Modelsโ€

Fascinating glimpse into the next gen of foundation models.

#FoundationModels #NLP #TokenizerFree #ADSAI2025

2 1 1 0