Lens (base)

Self-contained diffusers-layout snapshot of Microsoft's Lens text-to-image model, re-assembled for in-house (SceneWorks) use after Microsoft removed the original microsoft/Lens repository from the Hub.

This is a repackage, not a retrain โ€” every weight is byte-identical to a public upstream source (verified by tensor-level comparison against Comfy-Org's authentic redistribution):

Component Source License
transformer/ Lens DiT, bf16 โ€” from Comfy-Org/Lens (diffusion_models/lens_bf16.safetensors) MIT
text_encoder/ gpt-oss-20b (MXFP4), used encoder-only โ€” from openai/gpt-oss-20b Apache-2.0
tokenizer/ gpt-oss tokenizer โ€” from openai/gpt-oss-20b Apache-2.0
vae/ FLUX.2 VAE (AutoencoderKLFlux2) โ€” from black-forest-labs/FLUX.2-dev FLUX.2-dev license

The Lens text encoder is stock, frozen gpt-oss-20b (tensor-verified identical to Comfy's gpt_oss_20b_nvfp4 up to quantization), and the VAE is stock FLUX.2-dev (full-file identical to Comfy's flux2-vae). Only the transformer/ DiT is Lens-specific.

Layout

tokenizer/       tokenizer.json, tokenizer_config.json, special_tokens_map.json
text_encoder/    model-0000*-of-00002.safetensors (MXFP4), model.safetensors.index.json, config.json
transformer/     lens_bf16.safetensors
vae/             diffusion_pytorch_model.safetensors, config.json

Sampling defaults for the base model: 20 steps, guidance 5.0. For the distilled variant see SceneWorks/Lens-Turbo (4 steps, guidance 1.0).

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support