Featured image of post FungiTastic Baseline

FungiTastic Baseline

Training-free fine-grained semantic segmentation in low-data regimes.

Featured image of post NewtPhys

NewtPhys

A physics-grounded benchmark for evaluating foundation models on Newtonian reasoning.

Featured image of post CAD-Assistant

CAD-Assistant

Tool-augmented VLLMs as generic CAD task solvers.

Featured image of post All Languages Matter

All Languages Matter

Evaluating LMMs on culturally diverse visual reasoning across 100 languages.

Featured image of post Does Complexity Pay Off?

Does Complexity Pay Off?

Applying advanced algorithms to depression detection on the GLOBEM dataset.