GELab-Zero-4B-preview-Sico-Evolution

A 4B GUI agent fine-tuned (LoRA) from the open-source GELab-Zero-4B-preview base model on Microsoft Edge and Copilot UI trajectories. It is built with our general-purpose GUI model evolution pipeline โ€” an iterative mechanism that keeps lifting an agent's real task success rate round after round, and transfers to any GUI app.

Highlights

From 39.8% to 82.9%: Sico-Evolution achieves a dominant 82.9% Task Success Rate, a massive +43.1% absolute surge over the 39.8% base-model baseline.

Outperforms Closed-Source SOTAs: It edges out top proprietary giants like gpt-5.4 (79.7%), Claude-Opus-4.6 (81.3%), and claude-opus-4.7 (82.1%).

Vastly Exceeds Open-Source Models: It crushes leading competitors including kimi-k2.6 (62.6%) and UI-Venus-1.5-30B (61.0%).

Results

Edge / Copilot Test Cases โ€” TSR

Downloads last month
-
Safetensors
Model size
4B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for microsoft/GELab-Zero-4B-preview-Sico-Evolution

Adapters
5 models