Model optimization
Gemini Flash compression
Gemini 1.5 Flash offers a 1M token context window at competitive pricing. The large context window invites bloat — compression keeps costs down.
Flash context costs
Gemini 1.5 Flash costs $0.075/1M input tokens for contexts up to 128K tokens, and $0.15/1M for longer contexts. With a 100K-token context, that is $0.0075 per query. Compression to 35K tokens drops this to $0.0026 per query.
Frequently asked questions
Does the large context window make compression less important?
The opposite — a 1M window invites more context, making compression more valuable.
Does compression work with Gemini's multimodal inputs?
Compression applies to the text portions of multimodal inputs.
Try it yourself
Paste your long prompt into the playground, ask a question, and see what SuperCompress keeps and removes. Free, no signup needed.