| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by pulkas 289 days ago

Synthetic offers a subscription-based alternative to pay-per-token AI pricing. Instead of tracking usage costs, you get unlimited access to all their "always-on" models for a flat monthly fee.

What's included:

- 19 always-on models with both UI and API access

- LoRA fine-tuning support (FP8 precision, up to rank-64)

- Embedding models at no extra cost

- Standard: $20/month (135 msgs/5hrs), Pro: $60/month (1,350 msgs/5hrs)

Technical specs:

- Always-on models: No quantization (full precision)

- On-demand models: BF16 precision (FP8 for Jamba-based models only)

- LoRAs: FP8 precision, rank-8 to rank-64 support

- On-demand GPU pricing: 80GB at 3¢/min, 48GB at 1.5¢/min, 24GB at 1.2¢/min

- On-demand context limit: 32k tokens

Complete always-on model list:

DeepSeek: R1, R1-0528, V3, V3-0324, V3.1 (all 128k)

Meta Llama: 3.1-405B/70B/8B, 3.3-70B (128k), 4-Maverick-17B (524k), 4-Scout-17B (328k)

Others: Kimi-K2 (128k/256k), GPT-OSS-120B (128k), Qwen2.5-Coder-32B (32k), Qwen3-235B variants (256k), Qwen3-Coder-480B (256k), GLM-4.5 (128k)

Additional features:

- LoRA support for Llama 3.1/3.2 base models

- Embedding model: nomic-ai/nomic-embed-text-v1.5

- Any HuggingFace model available on-demand

Links: https://synthetic.new/ | With referral: https://synthetic.new/?referral=9oxapskWLeOrDT5