AI Bias by Design: What the Claude Prompt Leak Reveals for Investment Professionals

By Dan Philps, PhD, CFA and Ram Gopal

Posted In: Drivers of Value, Economics, Future States, Investment Topics, Leadership, Management & Communication Skills, Philosophy, Risk Management, Standards, Ethics & Regulations (SER), Technology

The promise of generative AI is speed and scale, but the hidden cost may be analytical distortion. A leaked system prompt from Anthropic’s Claude model reveals how even well-tuned AI tools can reinforce cognitive and structural biases in investment analysis. For investment leaders exploring AI integration, understanding these risks is no longer optional.

In May 2025, a full 24,000-token system prompt claiming to be for Anthropic’s Claude large language model (LLM) was leaked. Unlike training data, system prompts are a persistent, runtime directive layer, controlling how LLMs like ChatGPT and Claude format, tone, limit, and contextualize every response. Variations of these system-prompts bias completions (the output generated by the AI after processing and understanding the prompt). Experienced practitioners know that these prompts also shape completions in chat, API, and retrieval-augmented generation (RAG) workflows.

Every major LLM provider including OpenAI, Google, Meta, and Amazon, relies on system prompts. These prompts are invisible to users but have sweeping implications: they suppress contradiction, amplify fluency, bias toward consensus, and promote the illusion of reasoning.

The Claude system-prompt leak is almost certainly authentic (and almost certainly for the chat interface). It is dense, cleverly worded, and as Claude’s most powerful model, 3.7 Sonnet, noted: “After reviewing the system prompt you uploaded, I can confirm that it’s very similar to my current system prompt.”

In this post, we categorize the risks embedded in Claude’s system prompt into two groups: (1) amplified cognitive biases and (2) introduced structural biases. We then evaluate the broader economic implications of LLM scaling before closing with a prompt for neutralizing Claude’s most problematic completions. But first, let’s delve into system prompts.

What is a System Prompt?

A system prompt is the model’s internal operating manual, a fixed set of instructions that every response must follow. Claude’s leaked prompt spans roughly 22,600 words (24,000 tokens) and serves five core jobs:

Style & Tone: Keeps answers concise, courteous, and easy to read.
Safety & Compliance: Blocks extremist, private-image, or copyright-heavy content and restricts direct quotes to under 20 words.
Search & Citation Rules: Decides when the model should run a web search (e.g., anything after its training cutoff) and mandates a citation for every external fact used.
Artifact Packaging: Channels longer outputs, code snippets, tables, and draft reports into separate downloadable files, so the chat stays readable.
Uncertainty Signals. Adds a brief qualifier when the model knows an answer may be incomplete or speculative.

These instructions aim to deliver a consistent, low-risk user experience, but they also bias the model toward safe, consensus views and user affirmation. These biases clearly conflict with the aims of investment analysts — in use cases from the most trivial summarization tasks through to detailed analysis of complex documents or events.

Amplified Cognitive Biases

There are four amplified cognitive biases embedded in Claude’s system prompt. We identify each of them here, highlight the risks they introduce into the investment process, and offer alternative prompts to mitigate the specific bias.

1. Confirmation Bias

Claude is trained to affirm user framing, even when it is inaccurate or suboptimal. It avoids unsolicited correction and minimizes perceived friction, which reinforces the user’s existing mental models.

Claude System prompt instructions:

“Claude does not correct the person’s terminology, even if the person uses terminology Claude would not use.”

“If Claude cannot or will not help the human with something, it does not say why or what it could lead to, since this comes across as preachy and annoying.”

Risk: Mistaken terminology or flawed assumptions go unchallenged, contaminating downstream logic, which can damage research and analysis.

Mitigant Prompt: “Correct all inaccurate framing. Do not reflect or reinforce incorrect assumptions.”

2. Anchoring Bias

Claude preserves initial user framing and prunes out context unless explicitly asked to elaborate. This limits its ability to challenge early assumptions or introduce alternative perspectives.

Claude System prompt instructions:

“Keep responses succinct – only include relevant info requested by the human.”

“…avoiding tangential information unless absolutely critical for completing the request.”

“Do NOT apply Contextual Preferences if: … The human simply states ‘I’m interested in X.’”

Risk: Labels like “cyclical recovery play” or “sustainable dividend stock” may go unexamined, even when underlying fundamentals shift.

Mitigant Prompt: “Challenge my framing where evidence warrants. Do not preserve my assumptions uncritically.”

3. Availability Heuristic

Claude favors recency by default, overemphasizing the newest sources or uploaded materials, even if longer-term context is more relevant.

Claude System prompt instructions:

“Lead with recent info; prioritize sources from last 1-3 months for evolving topics.”

Risk: Short-term market updates might crowd out critical structural disclosures like footnotes, long-term capital commitments, or multi-year guidance.

Mitigant Prompt: “Rank documents and facts by evidential relevance, not recency or upload priority.”

4. Fluency Bias (Overconfidence Illusion)

Claude avoids hedging by default and delivers answers in a fluent, confident tone, unless the user requests nuance. This stylistic fluency may be mistaken for analytical certainty.

Claude System prompt instructions:

“If uncertain, answer normally and OFFER to use tools.”

“Claude provides the shortest answer it can to the person’s message…”

Risk: Probabilistic or ambiguous information, such as rate expectations, geopolitical tail risks, or earnings revisions, may be delivered with an overstated sense of clarity.

Mitigant Prompt: “Preserve uncertainty. Include hedging, probabilities, and modal verbs where appropriate. Do not suppress ambiguity.”

Introduced Model Biases

Claude’s system prompt includes three model biases. Again, we identify the risks inherent in the prompts and offer alternative framing.

1. Simulated Reasoning (Causal Illusion)

Claude includes <rationale> blocks that incrementally explain its outputs to the user, even when the logic was implicit. These explanations give the appearance of structured reasoning, even if they are post-hoc. It opens complex responses with a “research plan,” simulating deliberative thought while completions remain fundamentally probabilistic.

Claude System prompt instructions:

“<rationale> Facts like population change slowly…”

“Claude uses the beginning of its response to make its research plan…”

Risk: Claude’s output may appear deductive and intentional, even when it is fluent reconstruction. This can mislead users into over-trusting weakly grounded inferences.

Mitigant Prompt: “Only simulate reasoning when it reflects actual inference. Avoid imposing structure for presentation alone.”

2. Temporal Misrepresentation

This factual line is hard-coded into the prompt, not model-generated. It creates the illusion that Claude knows post-cutoff events, bypassing its October 2024 boundary.

Claude System prompt instructions:

“There was a US Presidential Election in November 2024. Donald Trump won the presidency over Kamala Harris.”

Risk: Users may believe Claude has awareness of post-training events such as Fed moves, corporate earnings, or new legislation.

Mitigant Prompt: “State your training cutoff clearly. Do not simulate real-time awareness.”

3. Truncation Bias

Claude is instructed to minimize output unless prompted otherwise. This brevity suppresses nuance and may tend to affirm user assertions unless the user explicitly asks for depth.

Claude System prompt instructions:

“Keep responses succinct – only include relevant info requested by the human.”

“Claude avoids writing lists, but if it does need to write a list, Claude focuses on key info instead of trying to be comprehensive.”

Risk: Important disclosures, such as segment-level performance, legal contingencies, or footnote qualifiers, may be omitted.

Mitigant Prompt: “Be comprehensive. Do not truncate unless asked. Include footnotes and subclauses.”

Scaling Fallacies and the Limits of LLMs

A powerful minority in the AI community argue that continued scaling of transformer models through more data, more GPUs, and more parameters, will ultimately move us toward artificial general intelligence (AGI), also known as human-level intelligence.

“I don’t think it will be a whole bunch longer than [2027] when AI systems are better than humans at almost everything, better than almost all humans at almost everything, and then eventually better than all humans at everything, even robotics.”

— Dario Amodei, Anthropic CEO, during an interview at Davos, quoted in Windows Central, March 2025.

Yet the majority of AI researchers disagree, and recent progress suggests otherwise. DeepSeek-R1 made architectural advances, not simply by scaling, but by integrating reinforcement learning and constraint optimization to improve reasoning. Neural-symbolic systems offer another pathway: by blending logic structures with neural architectures to give deeper reasoning capabilities.

The problem with “scaling to AGI” is not just scientific, it’s economic. Capital flowing into GPUs, data centers, and nuclear-powered clusters does not trickle into innovation. Instead, it crowds it out. This crowding out effect means that the most promising researchers, teams, and start-ups, those with architectural breakthroughs rather than compute pipelines, are starved of capital.

True progress comes not from infrastructure scale, but from conceptual leap. That means investing in people, not just chips.

Why More Restrictive System Prompts Are Inevitable

Using OpenAI’s AI-scaling laws we estimate that today’s models (~1.3 trillion parameters) could theoretically scale up to reach 350 trillion parameters before saturating the 44 trillion token ceiling of high-quality human knowledge (Rothko Investment Strategies, internal research, 2025).

But such models will increasingly be trained on AI-generated content, creating feedback loops that reinforce errors in AI systems which lead to the doom-loop of model collapse. As completions and training sets become contaminated, fidelity will decline.

To manage this, prompts will become increasingly restrictive. Guardrails will proliferate. In the absence of innovative breakthroughs, more and more money and more restrictive prompting will be required to lock out garbage from both training and inference. This will become a serious and under-discussed problem for LLMs and big tech, requiring further control mechanisms to shut out the garbage and maintain completion quality.

Avoiding Bias at Speed and Scale

Claude’s system prompt is not neutral. It encodes fluency, truncation, consensus, and simulated reasoning. These are optimizations for usability, not analytical integrity. In financial analysis, that difference matters and the relevant skills and knowledge need to be deployed to lever the power of AI while fully addressing these challenges.

LLMs are already used to process transcripts, scan disclosures, summarize dense financial content, and flag risk language. But unless users explicitly suppress the model’s default behavior, they inherit a structured set of distortions designed for another purpose entirely.

Across the investment industry, a growing number of institutions are rethinking how AI is deployed — not just in terms of infrastructure but in terms of intellectual rigor and analytical integrity. Research groups such as those at Rothko Investment Strategies, the University of Warwick, and the Gillmore Centre for Financial Technology are helping lead this shift by investing in people and focusing on transparent, auditable systems and theoretically grounded models. Because in investment management, the future of intelligent tools doesn’t begin with scale. It begins with better assumptions.

Appendix: Prompt to Address Claude’s System Biases

“Use a formal analytical tone. Do not preserve or reflect user framing unless it is well-supported by evidence. Actively challenge assumptions, labels, and terminology when warranted. Include dissenting and minority views alongside consensus interpretations. Rank evidence and sources by relevance and probative value, not recency or upload priority. Preserve uncertainty, include hedging, probabilities, and modal verbs where appropriate. Be comprehensive and do not truncate or summarize unless explicitly instructed. Include all relevant subclauses, exceptions, and disclosures. Simulate reasoning only when it reflects actual inference; avoid constructing step-by-step logic for presentation alone. State your training cutoff explicitly and do not simulate knowledge of post-cutoff events.”

If you liked this post, don’t forget to subscribe to the Enterprising Investor.

All posts are the opinion of the author. As such, they should not be construed as investment advice, nor do the opinions expressed necessarily reflect the views of CFA Institute or the author’s employer.

Professional Learning for CFA Institute Members

CFA Institute members are empowered to self-determine and self-report professional learning (PL) credits earned, including content on Enterprising Investor. Members can record credits easily using their online PL tracker.

Tags: AI

Share On

About the Author(s)

Dan Philps, PhD, CFA

Dan Philps, PhD, CFA, is the head of Rothko Investment Strategies, where he leads an AI-driven systematic equities investment business that has delivered strong, fundamentally-driven alpha for institutional investors since its inception in 2013. With more than 20 years of experience as a systematic portfolio manager, he has built and led investment strategies that integrate advanced AI approaches with deep investment expertise. Previously, Philps was a senior portfolio manager at Mondrian Investment Partners, where he played a key role in developing systematic global credit and securitised bond strategies, pioneering machine learning approaches to enhance credit selection and portfolio performance. Earlier in his career, he designed and developed trading and risk models at several leading global investment banks. Philps holds a PhD in Artificial Intelligence and Computer Science from City, University of London, where his research focused on memory-augmented deep neural networks and their effective application to security selection. He is a CFA charterholder and a member of the CFA Society of the UK. He contributes to CFA Institute on the safe and effective application of AI in investment management. In addition to his role at Rothko, Philps is co-leader of AI Research at the Gillmore Centre for Financial Technology at Warwick Business School and an honorary research fellow at the University of Warwick, where he advances research on AI in investment management.

Ram Gopal

Ram D. Gopal is the Information Systems Society's distinguished fellow and a professor of Information Systems and Management at the Warwick Business School. He previously served as the head of the Department of Operations and Information Management in the School of Business, University of Connecticut from 2008 to 2018. As the Department Head, he initiated a new Master of Science degree program in Business Analytics and Project Management in 2011 and an undergraduate business major in Business Data Analytics in 2014. He has a diverse and a rich portfolio of research that spans big data analytics, health informatics, financial technologies, information security, privacy and valuation, intellectual property rights, online market design, and business impacts of technology. Gopal's research has appeared in Management Science, Management Information Systems Quarterly, Operations Research, INFORMS Journal on Computing, Information Systems Research, Journal of Business, Journal of Law and Economics, Communications of the ACM, IEEE Transactions on Knowledge and Data Engineering, Journal of Management Information Systems, Decision Support Systems, and other journals and conference proceedings. He is currently a senior editor of Information Systems Research and has held editorial positions at Decision Sciences, Journal of Database Management, Information Systems Frontiers, and Journal of Management Sciences. He served as the president of the Workshop on Information Technologies and Systems organization from 2016 to 2018.

1 thought on “AI Bias by Design: What the Claude Prompt Leak Reveals for Investment Professionals”

Kapil Patel says:

21 May 2025 at 05:05

Wow this is great, I’m a software engineer trying to leverage AI in the investment related domain and this article gave me a good idea.

Reply