Training-free fine-grained semantic segmentation in low-data regimes.
A physics-grounded benchmark for evaluating foundation models on Newtonian reasoning.
Tool-augmented VLLMs as generic CAD task solvers.
Evaluating LMMs on culturally diverse visual reasoning across 100 languages.
Applying advanced algorithms to depression detection on the GLOBEM dataset.