General Power User

Streaming & Models

Streaming & Models

Real-Time Streaming

NeoCash streams AI responses in real time as they are generated. Rather than waiting for a complete response to appear all at once, you see text flowing onto the screen word by word. This provides several benefits:

  • Faster perceived response time — You begin reading useful information within seconds, even for complex analyses
  • Early course correction — If the response is heading in the wrong direction, you can see it immediately and refine your question
  • Natural reading pace — The streaming speed roughly matches comfortable reading speed, so you stay engaged with the content as it arrives

During streaming, a subtle indicator shows that the AI is still generating. You can scroll up to review earlier parts of the response while new content continues to appear at the bottom.

The Model Selector

NeoCash gives you the ability to choose which Claude model powers your conversation. The model selector is located near the message input area and lets you switch between three options:

  • Claude Sonnet — The balanced default
  • Claude Haiku — The speed-optimized model
  • Claude Opus — The depth-optimized model

Changing the model applies to the next message you send. You can switch models mid-conversation if your needs change — for instance, starting with Haiku for quick questions and switching to Opus when you need deep analysis.

Claude Sonnet: The Balanced Choice

Sonnet is the default model and the right choice for most interactions. It provides strong analytical capability with responsive speed, making it ideal for everyday financial questions and planning tasks.

Best for

  • General financial Q&A
  • Goal planning and progress reviews
  • Budget analysis and spending breakdowns
  • Document summaries and key figure extraction
  • Most day-to-day wealth management conversations

Characteristics

  • Responds quickly while maintaining high-quality reasoning
  • Handles multi-step financial calculations accurately
  • Provides well-structured, comprehensive answers
  • Balances detail with conciseness

Claude Haiku: Speed First

Haiku is designed for speed. It returns responses significantly faster than Sonnet, making it ideal for quick lookups, simple calculations, and situations where you need rapid answers without deep analysis.

Best for

  • Quick factual questions (“What is the 2026 401k contribution limit?”)
  • Simple calculations and conversions
  • Short clarifications on financial terms
  • Rapid brainstorming or list generation
  • Situations where you are iterating quickly and need fast feedback

Characteristics

  • Fastest response times of all three models
  • Concise, direct answers
  • Well-suited for straightforward tasks
  • Lower cost per message, which matters at scale

Tradeoffs

Haiku trades some analytical depth for speed. For complex multi-factor analyses, retirement projections, or nuanced tax strategy, Sonnet or Opus will provide more thorough reasoning.

Claude Opus: Maximum Depth

Opus is the most capable model available. It excels at complex reasoning, multi-step analysis, and situations that require synthesizing large amounts of information. When you need the AI to think deeply about your financial situation, Opus is the right choice.

Best for

  • Complex tax optimization strategies across multiple accounts
  • Comprehensive retirement planning with multiple variables
  • Detailed investment analysis comparing many options
  • Situations where extended thinking mode is valuable
  • Cross-referencing multiple uploaded documents
  • Scenarios requiring nuanced judgment about tradeoffs

Characteristics

  • Deepest analytical reasoning
  • Best at handling complex, multi-part questions
  • Most thorough exploration of edge cases and considerations
  • Pairs well with extended thinking mode for transparency into its reasoning process

Tradeoffs

Opus takes longer to respond than Sonnet or Haiku. The additional time is spent on more thorough reasoning, which is valuable for complex tasks but unnecessary for simple queries.

Choosing the Right Model

A practical framework for model selection:

SituationRecommended Model
Quick question with a known answerHaiku
General financial planningSonnet
Analyzing an uploaded documentSonnet
Complex multi-year tax strategyOpus
Comparing retirement scenariosOpus
Checking a single calculationHaiku
First draft of a financial planSonnet
Deep review of a financial planOpus

You can change models at any point in a conversation. The AI will continue with full context from previous messages regardless of which model generated them. There is no penalty for switching — the context carries over seamlessly.

Streaming and Tool Calls

When the AI uses tools during streaming (such as running a financial calculation or performing web research), the stream may pause briefly while the tool executes. You will see an indicator showing which tool is running. Once the tool returns its result, streaming resumes with the AI incorporating the tool’s output into its response. See the Tool Calls documentation for more detail on this process.