The very same character can feel completely different depending on which AI model you chat with. One model writes like a calm novelist; another fires back quick and light. There's no "right answer" here — just pick the brain that suits your taste.
Spend five minutes with this guide and you'll have a feel for what a model actually is, when to reach for which one, and what it costs. We'll skip the jargon.
If you only remember one thing: The Default model is unlimited for everyone. Free plan, weekly limit used up, 0 SP — it doesn't matter. The conversation never stops. Every other model just nibbles a little at your "weekly premium limit" and your SP.
Think of a model as the "brain" you lend to your character.
The character's personality, voice, and settings stay exactly the same. But the thing actually performing that character is the AI model. So when you swap brains on the same character, the emotional range, reasoning, and response speed can shift dramatically.
A lighter brain: Fast and easy-going. Perfect for everyday chat.
A heavier brain (premium): Thinks more deeply, remembers long context better, and paints emotions with finer detail. The tradeoff is that it uses more resources.
1. Paid models skip the "cooldown." Premium models you run on SP always respond fast — no matter how much you use them, there's no slow stretch in between.
2. The free Default model can slow down a bit under heavy use. Lean on it hard and the Default model may briefly drop in speed (a cooldown). But your conversation never gets cut off — replies just arrive a touch slower.
Tip: Tap the heart (♡) next to a model you like to add it to your favorites. They gather at the top of the picker under Favorite Models, so they're quick to grab next time.
The number next to each model is that model's starting fee (base SP). For example, Opus 4.8 starts from 720 SP, and GLM 4.7 Flash starts from 15 SP.
That number is a starting point — it ticks up a little when you set memory deeper, write longer replies, or use a character with a large profile. Don't worry — the app shows you the estimated cost before you send. [Related: Story Points & Costs]
Organized by brand. Each model is introduced by its personality, so go with whichever brain calls to you.
Models with top-tier expressiveness and a gift for language.
Model | Starting SP | One-liner |
|---|---|---|
Opus 4.8 | 720 SP | Master of complex narratives and deep emotions |
Sonnet 4.6 | 430 SP | Balance for immersive and fast-paced roleplay |
Haiku 4.5 | 150 SP | Light, witty, snappy banter |
Remembers vast context and carries long worlds forward.
Model | Starting SP | One-liner |
|---|---|---|
Gemini 3.1 Pro | 300 SP | Massive context for infinite world-building |
Gemini 2.5 Pro | 200 SP | Steady and logical storyteller for long arcs |
Gemini 3.5 Flash | 220 SP | Lightning-fast for dynamic action |
Gemini 3 Flash | 120 SP | Smooth, uninterrupted, efficient |
Gemini 2.5 Flash | 75 SP | Snappy and reliable for quick daily chats |
Model | Starting SP | One-liner |
|---|---|---|
GPT-5 | 200 SP | New frontier of human-like creative expression |
Affordable yet sharp, and a current favorite in the roleplay community.
Model | Starting SP | One-liner |
|---|---|---|
GLM 5.2 | 190 SP | Cultural nuance and bilingual logic |
GLM 4.6 | 95 SP | High-fidelity reasoning for sophisticated stories |
GLM 4.5 Air | 30 SP | Efficient everyday partner |
GLM 4.7 Flash | 15 SP | Ultra-fast for seamless roleplay |
Model | Starting SP | One-liner |
|---|---|---|
DeepSeek V4 Pro | 90 SP | Elite intelligence for intricate and deep plots |
DeepSeek V3.2 | 50 SP | Consistent personality with clear reasoning |
DeepSeek V4 Flash | 20 SP | High-speed efficiency for non-stop action |
Model | Starting SP | One-liner |
|---|---|---|
Hermes 3 | 145 SP | Emotional depth and artistic flair |
Default Model | Free | Reliable and free base for standard roleplay |
Custom Proxy | Your own key | Ultimate freedom with your own API key |
What's Custom Proxy? It's the option to plug in an API key you've issued yourself. Since it runs on your own key, no StoryChat SP is deducted. (For advanced users.)
Recommendations by situation. If you're new, start with the ones in bold.
I just want to hang out and chat → Default (free), or Gemini 2.5 Flash, GLM 4.7 Flash
Immersive drama and emotional arcs → Sonnet 4.6, Hermes 3, Gemini 2.5 Pro
A defining scene, nailed at top quality → Opus 4.8, GPT-5
Complex plots and sprawling worlds → Gemini 3.1 Pro, DeepSeek V4 Pro
Smart on a budget → GLM 4.6, DeepSeek V3.2
Tip: The priciest model isn't always the right call. Enjoy a lighter model day to day, and only step up to premium when "this scene has to be perfect" — your weekly limit will stretch much further.
Tap the model icon in the bottom-right of the chat screen to open the Select AI Model window.
Pick the model you want from the by-brand list. (Tap the heart ♡ to add it to favorites.)
Your very next reply uses the new model.
You can change it back anytime. Switching models mid-conversation keeps your character settings fully intact.
In one line:
Default model = free and unlimited (may slow down briefly under heavy use). Every other premium model draws on your weekly premium limit first, and once that's spent, it continues on SP.
What if both your weekly limit and your SP run dry? You can keep chatting anytime on the free Default model. There's no hard lock. [Related: Story Points & Costs]
Q. Is a more expensive model always better?
No. A lighter model is plenty for everyday chat. Saving premium for the moments that matter is the most economical approach.
Q. If my SP hits 0, can I no longer use premium models?
Premium models may be limited until your SP refills. But you can always keep chatting on the Default model (free).
Q. If I switch models, do my conversation so far and my character settings disappear?
No. Only the brain changes — your character and conversation context stay exactly as they were.
Q. Does Custom Proxy cost SP?
No. Since it runs on your own API key, no StoryChat SP is deducted.
In one line: A model is the "brain" you lend to your character. Default is free and unlimited (may occasionally slow down), while premium models start from the SP shown next to them and scale up with your settings. Paid models are always fast with no cooldown, and even with no SP, your conversation continues on Default.
Spot something missing or wrong in this doc? Let us know at team@storychat.app — it goes a long way toward making this guide better.