#INT8

@doyouknnow.bsky.social

3 months ago

LLM 양자화 완벽 가이드! INT4로 메모리 87.5% 절감, FP8로 처리량 43% 향상. GPTQ vs AWQ vs GGUF 비교, Llama 3 양자화 성능 벤치마크, Q4까지 손실 2% 미만! Pruning + Knowledge Distillation 경량화 기법, 하드웨어별 추천 전략, QLoRA Fine-tuning까지!

#AWQ #FP8 #GGUF #GPTQ #INT4 #INT8 #KnowledgeDistillation #Llama3 #llamacpp
doyouknow.kr/618/llm-quan...

0 0 0 0

Petar Toushkov

@cellfourteen.petertoushkov.eu

5 months ago

#gaming #Cyberpunk2077 #FSR4 #INT8 #PathTracing

1 0 0 0

Petar Toushkov

@cellfourteen.social.petertoushkov.eu.ap.brid.gy

5 months ago

YoutTube thumbnail: "FSR 4 INT8, RX 7800 XT, But Can It Path Trace?"

Let's give this RX 7800 XT a run for its money. Can it path trace? ->

Cyberpunk 2077 - FSR 4 - INT8 - Path Tracing - RX 7800 XT

https://www.youtube.com/watch?v=1eJJ2VmCUUY

1 0 0 1

Petar Toushkov

@cellfourteen.petertoushkov.eu

5 months ago

Final Fantasy XVI - FSR 4 - INT8 DLL - RX 7800 XT YouTube video by cellfourteen

Final Fantasy XVI - #FSR4 - #INT8 DLL - RX 7800 XT

www.youtube.com/watch?v=91hV...

#gaming

1 0 0 0

HGPU group

@hgpu.bsky.social

5 months ago

Performance and Numerical Aspects of Decompositional Factorizations with FP64 Floating-Point Emulation in INT8 Mixing precisions for performance has been an ongoing trend as the modern hardware accelerators started including new, and mostly lower-precision, data formats. The advantage of using them is a gre…

Performance and Numerical Aspects of Decompositional Factorizations with FP64 Floating-Point Emulation in INT8

#NVIDIA #Int8 #FP64 #Factorization

hgpu.org?p=30278

0 0 0 0

LLMs

@llms.activitypub.awakari.com.ap.brid.gy

10 months ago

Original post on franksworld.com

How to Optimize for performance with vLLM vLLM, a versatile and efficient LM inference engine. Th...

www.franksworld.com/2025/05/09/how-to-optimi...

#AI #Red #Hat #AI/ML […]

0 0 0 0

CloudThrill

@cloudthrill.bsky.social

1 year ago

#💡NewBlogAlert! What is hashtag#Quantization 🙋🏻‍♀️⁉️
llama.cpp, GGML GGUF, Bets K Quants,The Microsoft BitNet Quants .. Everything you need to know about quantization in one article 🔥
#LLMs #LLAMA #Quantization #Kquants #INT8 #AI #AIevals

0 0 0 0

Posts tagged #INT8