#Tokenizer

1 month ago

How does AI memory work? It's not at all like your phone or computer. Here's the scoop: 500ways.com/how-does-ai-... ( #AI, #artificialIntelligence, #tokenMemory, #tokenizer, #tokenized, #nonLinearMemory, #AIMemory, #LLM, #largeLanguageModel, #serverFarm, #serverMemory)

0 0 0 0

1 month ago

Visual-Word Tokenizer: Beyond Fixed Sets of Tokens in Vision Transformers

Leonidas Gee, Wing Yan Li, Viktoriia Sharmanska, Novi Quadrianto

Action editor: Blake Richards

https://openreview.net/forum?id=YYOS1FHYG3

#tokenizer #visual #tokens

0 0 0 0

2 months ago

Discrete Audio Tokens: More Than a Survey!

Pooneh Mousavi, Gallil Maimon, Adel Moumen et al.

Action editor: Tatsuya Harada

https://openreview.net/forum?id=eqNchtvc6v

#tokenizers #tokenizer #tokenization

0 0 0 0

3 months ago

How does AI memory work? It's not at all like your phone or computer. Here's the scoop: 500ways.com/how-does-ai-... ( #AI, #artificialIntelligence, #tokenMemory, #tokenizer, #tokenized, #nonLinearMemory, #AIMemory, #LLM, #largeLanguageModel, #serverFarm, #serverMemory)

0 0 0 0

4 months ago

New #J2C Certification:

Discrete Audio Tokens: More Than a Survey!

Pooneh Mousavi, Gallil Maimon, Adel Moumen et al.

https://openreview.net/forum?id=eqNchtvc6v

#tokenizers #tokenizer #tokenization

0 0 0 0

4 months ago

How does AI memory work? It's not at all like your phone or computer. Here's the scoop: 500ways.com/how-does-ai-... ( #AI, #artificialIntelligence, #tokenMemory, #tokenizer, #tokenized, #nonLinearMemory, #AIMemory, #LLM, #largeLanguageModel, #serverFarm, #serverMemory)

2 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Aligning Foundation Encoders as Tokenizers for Diffusion Models

A three‑stage tokenizer let a diffusion model reach gFID 1.90 on ImageNet (256 × 256) after 64 epochs and beat the VAE baseline in a 2‑billion‑parameter text‑to‑image model. Read more: getnews.me/aligning-foundation-enco... #diffusion #tokenizer

0 0 0 0

5 months ago

How does AI memory work? It's not at all like your phone or computer. Here's the scoop: 500ways.com/how-does-ai-... ( #AI, #artificialIntelligence, #tokenMemory, #tokenizer, #tokenized, #nonLinearMemory, #AIMemory, #LLM, #largeLanguageModel, #serverFarm, #serverMemory)

1 0 0 0

7 months ago

How does AI memory work? It's not at all like your phone or computer. Here's the scoop: 500ways.com/how-does-ai-... ( #AI, #artificialIntelligence, #tokenMemory, #tokenizer, #tokenized, #nonLinearMemory, #AIMemory, #LLM, #largeLanguageModel, #serverFarm, #serverMemory)

0 1 0 0

𝐩 𝟑 𝐧 𝐆 𝐮 𝟏 𝐧 𝐙 𝐳

@p3ngu1nzz.bsky.social

8 months ago

first batch is underway of ~1500 pages. hopefully be able to get a few thousand good finds out of these. #ai #embedding #tokenizer

@samanthahoriz0n.bsky.social

2 0 1 0

8 months ago

How does AI memory work? It's not at all like your phone or computer. Here's the scoop: 500ways.com/how-does-ai-... ( #AI, #artificialIntelligence, #tokenMemory, #tokenizer, #tokenized, #nonLinearMemory, #AIMemory, #LLM, #largeLanguageModel, #serverFarm, #serverMemory)

1 0 0 0

@senoritadeveloper.bsky.social

9 months ago

How does AI memory work? It's not at all like your phone or computer. Here's the scoop: 500ways.com/how-does-ai-... (#AI, #artificialIntelligence, #tokenMemory, #tokenizer, #tokenized, #nonLinearMemory, #AIMemory, #LLM, #largeLanguageModel, #serverFarm, #serverMemory)

1 0 0 0

11 months ago

ElasticSearch — Analyzers, Tokens, Filters What are ElasticSearch’s Analyzers, Tokens, Filters and How to Implement Custom Ones

ElasticSearch — Analyzers, Tokens, Filters - What are Elasticsearch’s Analyzers, Tokens, Filters and How to Implement Custom Ones #elasticsearch #analyzer #tokenizer #filter #indexing medium.com/turkcell/ela...

0 0 0 0

Abed Khooli

@akhooli.bsky.social

1 year ago

Abed Khooli on LinkedIn: ALLaM Language Model and Revisiting Arabic Tokenizers A few days ago… ALLaM Language Model and Revisiting Arabic Tokenizers A few days ago, NCAI/SDAIA published a 7B instruction version of ALLaM (https://lnkd.in/diNUwrt2). The…

ALLaM Language Model and Revisiting Arabic Tokenizers
www.linkedin.com/posts/akhool...
#ALLaM #tokenizer #NLP #AI #LLMs

1 0 0 0

Tech Pakistan

@blockchainpakistan.bsky.social

1 year ago

"bert-base_cased" #tokenizer vs. "Xenova/gpt-4" #tokenizer for a given text.
"bert-base_cased" vocab length: 28996
Xenova/gpt-4" vocab length: 100263

0 0 0 0

Tech Pakistan

@blockchainpakistan.bsky.social

1 year ago

An Example of the vocabulary length of the "bert-base-cased" #tokenizer and a colored list of the tokens generated for the given text.
[UNK] -> Unknown word
[##] -> token for a word

0 0 0 0