YouTube entry: Rosettia video

rosettia-quy-gspo-nllb13b-merged (Spanish → Chanka/Ayacucho Quechua)

Standalone merged NLLB-200-1.3B fine-tuned with GSPO reinforcement learning for spa→quy. This is facebook/nllb-200-1.3B with the GSPO LoRA merged in — load directly, no PEFT needed.

ChrF 45.53 (w0), standalone, AmericasNLP 2021 spa→quy test. Full results, figures, scorecard, methodology, and honest caveats (which number is pre-registered vs best-found) are in the adapter repo: https://huggingface.co/Thermostatic/rosettia-quy-gspo-nllb13b-lora

Usage

import torch
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tok = AutoTokenizer.from_pretrained("Thermostatic/rosettia-quy-gspo-nllb13b-merged", src_lang="spa_Latn", tgt_lang="quy_Latn")
m = AutoModelForSeq2SeqLM.from_pretrained("Thermostatic/rosettia-quy-gspo-nllb13b-merged", torch_dtype=torch.bfloat16).cuda().eval()
bos = tok.convert_tokens_to_ids("quy_Latn")
enc = tok("No se por que sucedio eso.", return_tensors="pt").to("cuda")
print(tok.batch_decode(m.generate(**enc, forced_bos_token_id=bos, num_beams=5, max_new_tokens=128), skip_special_tokens=True)[0])

Research-grade; single-benchmark single-reference ChrF; no human evaluation. See the adapter repo for limitations.

Authors & contributions

A two-person SomosNLP hackathon project:

Estefanía Espinosa Fernández — data curation, and the initial Qwen3.5 LoRA experiments (comparing DoRA, rsLoRA and LoRA, and exploring data mixes).
Irving Ernesto Quezada Ramírez (irvingernesto.com) — the subsequent modeling through the final system: synthetic-data distillation, the NLLB pipeline, GSPO reinforcement learning, decoding/ensembling, evaluation, and release.

The project was a close collaboration; both contributions were essential to the result.

Compute & funding

All compute was self-funded out of pocket by Irving Ernesto Quezada Ramírez. The project used roughly ~90 hours of L40S and ~90 hours of A100 GPU time in total, the overwhelming majority of it on experimentation. Training the important models took about ~8 hours of A100 time, plus roughly ~4 hours of A100 for vLLM inference. All told, the whole project cost about ~$100.

Downloads last month: 64

Safetensors

Model size

1B params

Tensor type

BF16

Model tree for somosnlp-hackathon-2026/rosettia-quy-gspo-nllb13b-merged

Base model

facebook/nllb-200-1.3B

Finetuned

(29)

this model

somosnlp-hackathon-2026
/

rosettia-quy-gspo-nllb13b-merged

rosettia-quy-gspo-nllb13b-merged (Spanish → Chanka/Ayacucho Quechua)

Usage

Authors & contributions

Links

Compute & funding

Model tree for somosnlp-hackathon-2026/rosettia-quy-gspo-nllb13b-merged