I Installed the "AI Hedge Fund" With 56K GitHub Stars. Here's What Happened.

Disclaimer: This is not financial advice. I am not a financial advisor. Everything here is for educational and entertainment purposes only. I have no positions in any of the stocks mentioned. Do your own research.

It started with a GitHub notification. One of those repos that somehow crosses your feed at the right time — 56,000 stars, updated four days ago, a one-person project called “AI Hedge Fund” by a developer named virattt.

The premise is audacious: 19 AI agents, each modeled after a legendary investor, voting on whether to buy or sell a stock. Warren Buffett agents. Nassim Taleb agents. Cathie Wood agents. Michael Burry. Bill Ackman. Charlie Munger. Peter Lynch. Stanley Druckenmiller. Plus a roster of functional analysts — technical, fundamental, sentiment, valuation — feeding data into the decision.

And then there’s the disclaimer, front and center: “The system does not actually make any trades.”

I installed it anyway. Here’s what I found.

The Setup

The project lives at github.com/virattt/ai-hedge-fund. Clone it, set up a .env file, and you’re off.

Two API keys are recommended: one for the LLM (OpenAI, Anthropic, Groq, or DeepSeek) and one for financial data. The financial data key costs money — roughly $20-50/month depending on how much you query. I used Groq for the LLM (free tier) and yfinance as a fallback for data.

git clone https://github.com/virattt/ai-hedge-fund.git
cd ai-hedge-fund
poetry install
cp .env.example .env
# Edit .env with your API keys
poetry run python src/main.py --ticker AAPL,MSFT,NVDA

The project uses LangGraph for orchestrating the agent workflow. Each investor agent is a node in the graph. The graph state carries analyst signals forward until a Portfolio Manager node makes the final call.

What Each Agent Actually Does

The code is revealing. Each investor agent follows the same pattern: fetch financial data, run quantitative analysis, stuff everything into a compact prompt that says “You are Warren Buffett,” ask for JSON. Here’s what the Warren Buffett agent’s DCF calculation actually does:

# Three-stage DCF model
stage1_growth = min(conservative_growth, 0.08)  # Stage 1: cap at 8%
stage2_growth = min(conservative_growth * 0.5, 0.04)  # Stage 2: cap at 4%
terminal_growth = 0.025  # Long-term GDP growth rate
discount_rate = 0.10  # Conservative 10%
intrinsic_value = stage1_pv + stage2_pv + terminal_pv
# Apply 15% additional margin of safety
conservative_intrinsic_value = intrinsic_value * 0.85

The margin of safety calculation — a Buffett signature — is baked in. That’s actually thoughtful. For the Taleb agent, the logic flips: it uses a “barbell strategy” — either extremely bullish or extremely bearish, avoiding anything in between.

The Output

Running against AAPL, MSFT, and NVDA took about 8 minutes. Here’s roughly what came out:

Warren Buffett: “bullish, confidence 72%. Wonderful business, strong moat, fair price.”
Nassim Taleb: “bearish, confidence 61%. Fragile structure, tail risk from iPhone cycle saturation.”
Cathie Wood: “bullish, confidence 68%. AI-driven disruption of healthcare and finance will multiply revenue.”
Michael Burry: “bearish, confidence 55%. Overvalued relative to intrinsic value.”
Portfolio Manager: “hold. Mixed signals — no consensus.”

The interesting part: Taleb and Burry were both bearish. Buffett and Cathie Wood were both bullish. Two completely different worldviews arriving at opposite conclusions from the same data.

The Real Cost

Here’s what the repo doesn’t tell you in the README:

LLM calls: Each of 19 investor agents makes at least one LLM call. That’s 19 API calls per ticker. For 3 tickers: 57 calls. At Groq rates this took 8 minutes. At OpenAI GPT-4o prices: roughly $0.15-0.30 per run.
Financial data: yfinance is free but unreliable for fundamentals. The paid API key runs $20-200/month.
Time: The LangGraph orchestration adds overhead. A cold run is 5-10 minutes.

The backtesting engine in src/backtesting/ is sophisticated — it simulates portfolio allocation, margin requirements, and computes real risk metrics. But backtesting requires historical data at scale, which means more API calls, more money.

What Surprised Me

The personas mostly don’t matter. The actual investment logic is in the quantitative analysis — ROE thresholds, debt ratios, margin of safety calculations. The “You are Warren Buffett” prompt is window dressing. The LLM mostly reads the numbers and applies the threshold rules.

The Portfolio Manager is the actual intelligence. The aggregator that takes all the signals and makes a final decision is where the real logic lives. Every agent gets equal say regardless of track record or confidence level.

Nobody’s running this with real money. The disclaimer is real and honest. The code doesn’t connect to any brokerage. Even if you wanted to, you’d have to build the execution layer yourself.

The code quality is genuinely good. 742 commits from a single developer. LangGraph used correctly. Clean separation of concerns. This isn’t a weekend hack; it’s a serious project.

The 19 Agents (and what they actually check)

Agent	Persona	What it checks
Aswath Damodaran	Valuation Dean	Intrinsic value via DCF
Ben Graham	Father of Value	Margin of safety, P/E, book value
Bill Ackman	Activist Investor	Catalysts, management quality
Cathie Wood	Growth Queen	Revenue growth, disruption potential
Charlie Munger	Rational Thinker	Business quality, rationality
Michael Burry	Big Short	Overvaluation, short candidates
Mohnish Pabrai	Dhandho Investor	Low-risk, high-upside asymmetry
Nassim Taleb	Black Swan	Tail risk, antifragility
Peter Lynch	10-Bagger Hunter	PEG ratio, understandable business
Phil Fisher	Scuttlebutt	Management, product innovation
Stanley Druckenmiller	Macro Legend	Macroeconomic trends
Warren Buffett	Oracle of Omaha	Moat, ROE, intrinsic value
Technical Analyst	Charts	MACD, RSI, moving averages
Fundamentals Analyst	Financials	Revenue, earnings, cash flow
Growth Analyst	Growth	Revenue growth trends
News Sentiment	Headlines	News impact scoring
Sentiment Analyst	Market Mood	Social and news sentiment
Valuation Analyst	Pricing	DCF, relative valuation

Plus: Risk Manager, Portfolio Manager.

Should You Run It?

If you’re curious about multi-agent systems: yes. Clone it, read the code, play with the ticker flags. It’s a well-built example of how to compose multiple specialized agents into a single pipeline.

If you’re expecting investment alpha: no. The personas add narrative fun but no actual predictive power. The quantitative screens are reasonable heuristics, but they don’t adapt to regime changes, liquidity constraints, or black swan events.

If you’re building something like this yourself: study the architecture. The LangGraph state management, the way data flows from analysts through risk management to the portfolio manager, the backtesting harness — these are all cleanly implemented. It’s a better learning resource than most paid courses.

The disclaimer says it all: “This project is for educational and research purposes only.” That’s exactly what it is.