UniPat AI
Popular repositories Loading
-
BabyVision
BabyVision PublicWe introduce BabyVision, a benchmark revealing the infancy of AI vision.
-
UniScientist
UniScientist PublicUniScientist is designed to advance universal scientific research intelligence through a unified paradigm
-
-
SaaS-Bench
SaaS-Bench PublicOfficial repository for SaaS-Bench: realistic, locally deployable SaaS workflows for GUI agent evaluation.
-
RoadmapBench
RoadmapBench PublicEvaluating Long-Horizon Agentic Software Development Across Version Upgrades — 115 tasks, 17 repos, 5 languages
Python 8
-
harbor
harbor PublicForked from harbor-framework/harbor
Harbor is a framework for running agent evaluations and creating and using RL environments.
Python 1
Repositories
- EvoCodeBench Public
UniPat-AI/EvoCodeBench’s past year of commit activity - harbor_multiturn Public
UniPat-AI/harbor_multiturn’s past year of commit activity - RoadmapBench Public
Evaluating Long-Horizon Agentic Software Development Across Version Upgrades — 115 tasks, 17 repos, 5 languages
UniPat-AI/RoadmapBench’s past year of commit activity - SaaS-Bench Public
Official repository for SaaS-Bench: realistic, locally deployable SaaS workflows for GUI agent evaluation.
UniPat-AI/SaaS-Bench’s past year of commit activity - harbor Public Forked from harbor-framework/harbor
Harbor is a framework for running agent evaluations and creating and using RL environments.
UniPat-AI/harbor’s past year of commit activity - SWE-Vision Public
UniPat-AI/SWE-Vision’s past year of commit activity - UniScientist Public
UniScientist is designed to advance universal scientific research intelligence through a unified paradigm
UniPat-AI/UniScientist’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…