Rendered at 12:47:56 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.
woadwarrior01 13 hours ago [-]
In hindsight, the Prior Labs exit to SAP couldn't have been timed better.
hodgehog11 14 hours ago [-]
On the one hand, this is impressive. TabPFN was already state of the art and is seriously shaking up Bayesian prediction for tabular data (which is almost everything).
On the other hand, perhaps it is just me, but I do not feel that this is an acceptable form of benchmark reporting in this domain. TabArena actually has multiple metrics, since ELO does not properly quantify the degree of improvement. The fact that these are not displayed here should give pause. Also the results section in the GitHub is a dumpster fire.
Eridrus 14 hours ago [-]
GitHub Repo: Please see the results folder
Results folder: Here's some undocumented parquet files
Definitely feels like they're hiding the ball lol.
If they had good benchmarks they'd talk about them.
Not comparing to tuned xgboost is also a warning sign.
nok22kon 8 hours ago [-]
wouldn't xgboost be covered under autogluon? not perfect, but not missing either
actusual 13 hours ago [-]
150,000 rows of data, where will I store it all?!
estetlinus 6 hours ago [-]
Big Data is terrifying and only solvable with opaque deep learning
piokoch 6 hours ago [-]
You don't need to. Just sample 1% of data, let the model do the feature engineering, ask model model to replicate everything in Pandas (or R, or whatever else), so you can run it for a full dataset.
nojito 13 hours ago [-]
The biggest misconception that people have when modeling using tabular data is that more data = better model.
mvcalder 8 hours ago [-]
There's a story sharing the front page right now about scaling laws, so it's not an unreasonable assumption.
On the other hand, perhaps it is just me, but I do not feel that this is an acceptable form of benchmark reporting in this domain. TabArena actually has multiple metrics, since ELO does not properly quantify the degree of improvement. The fact that these are not displayed here should give pause. Also the results section in the GitHub is a dumpster fire.
Results folder: Here's some undocumented parquet files
Definitely feels like they're hiding the ball lol.
If they had good benchmarks they'd talk about them.
Not comparing to tuned xgboost is also a warning sign.
https://news.ycombinator.com/item?id=48689744