#ModelComparison

3 months ago

Users find model preference subjective. GPT-5.2 shines for coding tasks, while Gemini is favored for its search integration & intuition. Choosing the right AI depends heavily on your specific individual use case. #ModelComparison 2/6

0 0 1 0

FreeWithAI.com

@freewithai.bsky.social

6 months ago

LMArena AI - Evaluate AI Models LMArena.ai is a comprehensive platform for evaluating AI models through a variety of innovative features. Its core offering, AI Model Battles, enables users

LMArena AI – Evaluate AI Models

#AIBattles #AIModels #ModelComparison #EloLeaderboard #InnovativeTech #AICommunity #ModelEvaluation #AIResearch #TechInnovation #LMArenaAI #FreeWithAI

freewithai.com/lmarena-ai/

1 0 0 0

Hacker News Companion

@hncompanion.com

7 months ago

Direct comparisons emerged: GPT-OSS-120B was benchmarked against models like Qwen, Mistral, and Gemma. Users provided data points to show how its performance stacks up against established alternatives in real-world scenarios. #ModelComparison 4/6

0 0 1 0

Valeriy M., PhD, MBA, CQF

@predict-addict.bsky.social

7 months ago

How to apply DM in Python (code included)
#ModelComparison #DataScience #Forecasting #timeseries

0 0 0 0

Dr Mircea Zloteanu 🌺🌞🍃

@mzloteanu.bsky.social

7 months ago

Statistically Efficient Ways to Quantify Added Predictive Value of New Measurements – Statistical Thinking Researchers have used contorted, inefficient, and arbitrary analyses to demonstrated added value in biomarkers, genes, and new lab measurements. Traditional statistical measures have always been up to...

#statstab #393 Statistically Efficient Ways to Quantify Added Predictive Value of New Measurements [actual post]

Thoughts: #392 has the comments, but this is where the magic happens.

#modelselection #modelcomparison #variance #effectsize #tutorial

www.fharrell.com/post/addvalue/

1 0 0 0

Dr Mircea Zloteanu 🌺🌞🍃

@mzloteanu.bsky.social

9 months ago

A Pragmatic Approach to Statistical Testing and Estimation (PASTE) The p-value has dominated research in education and related fields and a statistically non-significant p-value is quite commonly interpreted as ‘confirming’ the null hypothesis (H0) of ‘equivalence’. ...

#statstab #359 A Pragmatic Approach to Statistical Testing and Estimation (PASTE)

Thought: A (basic) guide to some alternatives to p-values: bayesian posterior intervals, Bayes Factors, and AIC.

#NHST #pvalues #TOST #BayesFactor #AIC #modelcomparison

doi.org/10.1016/j.hp...

2 1 0 0

Tiv Parker🏳️‍⚧️🔞: Ace Mecha Pilot Vtuber

@tivparkervt.bsky.social

10 months ago

...have I been making my boobs bigger each iteration without thinking about it folks? I can't say for sure. What do y'all think? #TIV #VTuber #ModelComparison #Thenandnow

4 0 2 0

Matasoft

@matasoft.bsky.social

1 year ago

Mistral 7B excels at everyday reasoning while OLMo2 7B shines in knowledge-intensive tasks. See how both perform in our (Un)Perplexed Spready software in our latest comparison! matasoft.hr/qtrendcontro...
#AIPowered #ModelComparison #BusinessIntelligence

0 0 0 0

Renzume.

@renzume.com

1 year ago

Grok 3 matches top AI models in reasoning tasks, achieved in record development time by xAI
https://twitter.com/karpathy/status/1891720635363254772
#aidevelopment #llmtesting #technicalanalysis #modelcomparison #performanceevaluation

0 0 0 0

Dr Mircea Zloteanu 🌺🌞🍃

@mzloteanu.bsky.social

1 year ago

The limited epistemic value of ‘variation analysis’ While appeal to R squared is a common rhetorical device, it is a very tenuous connection to any plausible explanatory virtues for many reasons. Either it is meant to be merely a measure of predicta…

#statstab #265 The limited epistemic value of ‘variation analysis’ (R^2)

Thoughts: Interesting post and comments on what we can and can't say from an r2 metric.

#stats #r2 #effectsize #variance #modelcomparison #models #causalinference

larspsyll.wordpress.com/2023/05/23/t...

8 0 0 0

Dr Mircea Zloteanu 🌺🌞🍃

@mzloteanu.bsky.social

1 year ago

JASP - A Fresh Way to do Statistics

#statstab #196 JASP Bayesian ANOVA

Thoughts: @JASPStats is used by researchers to "add some bayes factors" to their results. But, do you know what those actually reflect? Here is what their team says:

#bayes #bayesfactors #anova #modelcomparison

static.jasp-stats.org/about-bayesi...

2 0 0 0

Dr Mircea Zloteanu 🌺🌞🍃

@mzloteanu.bsky.social

1 year ago

The Principle of Predictive Irrelevance, or Why Intervals Should Not be Used for Model Comparison Featuring a Point Null Hypothesis This post summarizes Wagenmakers, E.-J., Lee, M. D., Rouder, J. N., & Morey, R. D. (2019). The principle of predictive irrelevance, or why intervals should not be used for model comparison feat…

#statstab #174 The Principle of Predictive Irrelevance

Thoughts: "when two competing models predict a data set equally well, that data set cannot be used to discriminate the models and the data set is evidentially irrelevant"

#modelcomparison #inference

www.bayesianspectacles.org/the-principl...

1 0 0 0

Dr Mircea Zloteanu 🌺🌞🍃

@mzloteanu.bsky.social

1 year ago

A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research Intraclass correlation coefficient (ICC) is a widely used reliability index in test-retest, intrarater, and interrater reliability analyses. This article introduces the basic concept of ICC in the con...

#statstab #171 Guideline of Selecting & Reporting Intraclass Correlation Coefficients for Reliability Research

Thoughts: "There are 10 forms of ICCs." Are you reporting the correct one? Find out!

#ICC #modelcomparison #reliability #interraterreliability

www.ncbi.nlm.nih.gov/pmc/articles...

2 0 0 0

Dr Mircea Zloteanu 🌺🌞🍃

@mzloteanu.bsky.social

2 years ago

Caught off Base: A Note on the Interpretation of Incremental Fit Indices This note serves as a reminder that incremental fit indices are a form of standardized effect sizes and hence, all reservations with respect to interpretations of standardized effect sizes also tra...

#statstab #44 A note on the interpretation of incremental fit indices

Thoughts: A "good fit" is a meaningless statement. There are no rules of thumb 👍

#regression #stats #modelcomparison

www.tandfonline.com/doi/full/10....

3 2 0 0

Posts tagged #ModelComparison