https://github.com/davdittrich/robscale
#RStats #RobustStatistics #DataScience #Optimization
orig https://fediscience.org/@davdittrich/116201886682143700 4/4
Latest posts tagged with #RobustStatistics on Bluesky
https://github.com/davdittrich/robscale
#RStats #RobustStatistics #DataScience #Optimization
orig https://fediscience.org/@davdittrich/116201886682143700 4/4
A two-panel benchmark charting performance multipliers of the optimized C++ robscale package against legacy pure-R implementations across sample sizes from $n = 3$ up to $10^7$, with the vertical axis starting honestly at 0x. The left panel reveals a massive speedup for M-estimators (robLoc, robScale, adm vs. revss), pushing up to ~28x for robScale. The right panel tracks scale estimators ($Q_n$, $S_n$ vs. robustbase), showing the speedup curve upward from 1.6x, approaching 10x at large sample sizes. Shaded ribbons show 95% bootstrap confidence intervals, visually confirming dramatic computational efficiency.
Robust estimation demands highly efficient computation, especially in streaming anomaly detection where latency budgets are tight.
While Rousseeuw & Croux's robust estimators ($Q_n$ and $S_n$), and Rousseeuw & Verboven's M-estimators of location and scale […]
[Original post on fediscience.org]