Lens-Turbo

Self-contained diffusers-layout snapshot of Microsoft's Lens-Turbo (distilled) text-to-image model, re-assembled for in-house (SceneWorks) use after Microsoft removed the original microsoft/Lens-Turbo repository from the Hub.

This is a repackage, not a retrain β€” every weight is byte-identical to a public upstream source (verified by tensor-level comparison against Comfy-Org's authentic redistribution):

Component Source License
transformer/ Lens-Turbo DiT, bf16 β€” from Comfy-Org/Lens (diffusion_models/lens_turbo_bf16.safetensors) MIT
text_encoder/ gpt-oss-20b (MXFP4), used encoder-only β€” from openai/gpt-oss-20b Apache-2.0
tokenizer/ gpt-oss tokenizer β€” from openai/gpt-oss-20b Apache-2.0
vae/ FLUX.2 VAE (AutoencoderKLFlux2) β€” from black-forest-labs/FLUX.2-dev FLUX.2-dev license

The text encoder is stock, frozen gpt-oss-20b and the VAE is stock FLUX.2-dev; only the transformer/ DiT is Lens-specific. The encoder + VAE are identical to those in SceneWorks/Lens β€” only the distilled transformer/ differs.

Layout

tokenizer/       tokenizer.json, tokenizer_config.json, special_tokens_map.json
text_encoder/    model-0000*-of-00002.safetensors (MXFP4), model.safetensors.index.json, config.json
transformer/     lens_turbo_bf16.safetensors
vae/             diffusion_pytorch_model.safetensors, config.json

Sampling defaults for the distilled model: 4 steps, guidance 1.0 (β‰ˆ no CFG).

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support