Sakana AI

company

Verified

https://sakana.ai/

AI & ML interests

We are a Tokyo-based R&D company on a quest to create a new kind of foundational AI model based on nature-inspired intelligence.

Recent Activity

LayneH submitted a paper 1 day ago

How Good Can Linear Models Be for Time-Series Forecasting?

LayneH submitted a paper 1 day ago

How Good Can Linear Models Be for Time-Series Forecasting?

tksii authored a paper 4 days ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

View all activity

Papers

How Good Can Linear Models Be for Time-Series Forecasting?

Fast-weight Product Key Memory

View all Papers

submitted 2 papers to Daily Papers 1 day ago

How Good Can Linear Models Be for Time-Series Forecasting?

Paper • 2606.27282 • Published 6 days ago • 7

How Good Can Linear Models Be for Time-Series Forecasting?

Paper • 2606.27282 • Published 6 days ago • 7

authored a paper 4 days ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

Paper • 2606.16613 • Published 16 days ago • 9

submitted a paper to Daily Papers 5 days ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

Paper • 2606.16613 • Published 16 days ago • 9

authored a paper 6 days ago

CoffeeBench: Benchmarking Long-Horizon LLM Agents in Heterogeneous Multi-Agent Economies

Paper • 2606.16613 • Published 16 days ago • 9

authored 3 papers 20 days ago

Mitigating Reward Hacking in RLHF via Advantage Sign Robustness

Paper • 2604.02986 • Published Apr 3 • 3

LLM Routing with Dueling Feedback

Paper • 2510.00841 • Published Oct 1, 2025

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

Paper • 2606.07379 • Published 26 days ago • 5

updated a dataset 27 days ago

SakanaAI/sudoku-bench-nikoli

Viewer • Updated 27 days ago • 100 • 73

published a dataset 27 days ago

SakanaAI/sudoku-bench-nikoli

Viewer • Updated 27 days ago • 100 • 73

authored a paper 29 days ago

HakushoBench: A Japanese Chart and Table VQA Benchmark from Governmental White Papers

Paper • 2606.01132 • Published May 31 • 6

submitted a paper to Daily Papers 29 days ago

HakushoBench: A Japanese Chart and Table VQA Benchmark from Governmental White Papers

Paper • 2606.01132 • Published May 31 • 6

updated a dataset about 1 month ago

SakanaAI/lm-wheels

Updated May 28 • 59 • 1

updated a Space about 1 month ago

Llama 3 Karamaru V1

Classical Japanese Chatbot

updated a dataset about 2 months ago

SakanaAI/KamonBench

Updated May 14 • 32 • 1

published a dataset about 2 months ago

SakanaAI/KamonBench

Updated May 14 • 32 • 1

updated a dataset about 2 months ago

SakanaAI/FishMath-SFT-Data

Viewer • Updated May 8 • 23.3k • 76 • 3

published a dataset about 2 months ago

SakanaAI/FishMath-SFT-Data

Viewer • Updated May 8 • 23.3k • 76 • 3

published a model about 2 months ago

SakanaAI/gpt-oss-120b-sft-aimo3-fishmath

Text Generation • 117B • Updated May 8 • 6

updated a model about 2 months ago

SakanaAI/gpt-oss-120b-sft-aimo3-fishmath

Text Generation • 117B • Updated May 8 • 6