| Synthetic offers a subscription-based alternative to pay-per-token AI pricing. Instead of tracking usage costs, you get unlimited access to all their "always-on" models for a flat monthly fee. What's included: - 19 always-on models with both UI and API access - LoRA fine-tuning support (FP8 precision, up to rank-64) - Embedding models at no extra cost - Standard: $20/month (135 msgs/5hrs), Pro: $60/month (1,350 msgs/5hrs) Technical specs: - Always-on models: No quantization (full precision) - On-demand models: BF16 precision (FP8 for Jamba-based models only) - LoRAs: FP8 precision, rank-8 to rank-64 support - On-demand GPU pricing: 80GB at 3¢/min, 48GB at 1.5¢/min, 24GB at 1.2¢/min - On-demand context limit: 32k tokens Complete always-on model list: DeepSeek: R1, R1-0528, V3, V3-0324, V3.1 (all 128k) Meta Llama: 3.1-405B/70B/8B, 3.3-70B (128k), 4-Maverick-17B (524k), 4-Scout-17B (328k) Others: Kimi-K2 (128k/256k), GPT-OSS-120B (128k), Qwen2.5-Coder-32B (32k), Qwen3-235B variants (256k), Qwen3-Coder-480B (256k), GLM-4.5 (128k) Additional features: - LoRA support for Llama 3.1/3.2 base models - Embedding model: nomic-ai/nomic-embed-text-v1.5 - Any HuggingFace model available on-demand Links: https://synthetic.new/ | With referral: https://synthetic.new/?referral=9oxapskWLeOrDT5 |