
Ask AI to reason through problems step by step, not just give an answer. This single technique dramatically improves accuracy on math, logic, analysis, and any task where the path matters as much as the result.
TLDR
Add "think step by step" or "walk me through your reasoning before giving a final answer." It forces the model to reason rather than guess, which significantly reduces errors on complex problems.
Identify tasks that benefit from reasoning
Not every prompt needs chain-of-thought. Use it for math, multi-step decisions, logic puzzles, root cause analysis, and any task where errors come from jumping to conclusions. Simple factual questions don't benefit.
Add a reasoning trigger
Include one of these phrases: "think step by step", "walk me through your reasoning", "show your work before giving a conclusion", or "explain each step, then give your final answer." Place it at the end of the prompt.
Front-load all context
Give the model everything it needs to reason over before asking for the reasoning. Constraints, numbers, background, and edge cases should all come before your request.
Ask for the answer last
Structure your prompt so the conclusion follows the reasoning: "First explain... then give your recommendation." If the answer comes first, the model may rationalize a guess rather than actually reason.
Read the chain, not just the answer
Chain-of-thought's real power is that you can check each step. If a step is wrong, point to it: "In step 3 you assumed X, but actually Y. Re-reason from that point." This is far more precise than "try again."
Example prompt
Strategic trade-off analysis: a case where skipping the reasoning almost always leads to the wrong answer
A startup has $50,000 in runway. They're spending $12,000/month on salaries, $3,000 on infrastructure, and $1,500 on marketing. They have two options: cut one $8,000/month engineer to extend runway, or double the marketing budget to try closing a $30,000 customer deal. Think through the financial implications of each option step by step, including best-case and worst-case scenarios, then give a clear recommendation with your reasoning.
Math and calculations
Any time numbers are involved and accuracy matters. Chain-of-thought reduces arithmetic and logic errors by making each calculation visible and checkable.
Multi-step decisions
Trade-offs, pros/cons analysis, strategic choices where getting one step wrong cascades. The reasoning chain lets you spot and correct specific errors rather than re-doing the whole thing.
Root cause analysis
Asking AI to reason through a bug or problem step by step, checking one assumption at a time, produces far more reliable diagnoses than asking "why is this broken?"
Using it on simple tasks
Chain-of-thought makes responses longer. On simple factual questions it adds length without improving accuracy. Save it for tasks with genuine multi-step reasoning.
Only reading the conclusion
The chain IS the value. If you skip to the answer without checking the steps, you lose the ability to catch and correct errors mid-reasoning.
Vague setup
Chain-of-thought cannot improve reasoning over missing information. If your prompt is vague, you get a well-structured chain of incorrect reasoning.
On complex, multi-step problems, yes. On simple factual questions, it has no effect and just makes responses longer. The technique is most valuable where there is genuine reasoning to do.
Yes, but more capable models benefit more. GPT-4, Claude 3.5 Sonnet, and Gemini 1.5 Pro show the strongest improvements. On smaller models the reasoning chains are sometimes less reliable.
Absolutely. It pairs well with role prompting ("You are a financial analyst. Think step by step...") and few-shot prompting, where you show examples of good reasoning chains before your question.
Bottom line
Chain-of-thought prompting is the single most reliable technique for improving AI accuracy on complex problems. Add "think step by step" whenever the task involves multi-step reasoning, and read the chain, not just the answer.
Prompt packages that apply this technique directly.
Research
Research with AI works best when you treat it as a thinking partner, not a search engine.
See promptsCode Review
Most code reviews either miss real problems or create friction without adding value.
See promptsDebugging
Debugging is a skill that separates productive developers from frustrated ones.
See prompts