Methodi Documentation

AI-powered platform for financial research—from literature review to reproducible analysis.

Quick Start

Get from paper to results in 5 minutes.

1
Create a Project
Click "New Project" from your dashboard. Give it a name that describes your research goal.
2
Upload a Paper
Go to the Research tab and upload your PDF. Methodi will parse it and index it for AI-powered search.
3
Upload Your Data
Go to Data Library and drag your CSV or Parquet file. Methodi will profile it automatically, detecting columns and data types.
4
Ask the Research Assistant
In the Research tab, ask: "What methodology does this paper use?" The AI will search the paper and explain.
5
Generate a Notebook
In the Research tab, ask the AI to generate a notebook. It will create a complete, runnable Marimo notebook saved to your workspace.
6
Run and Commit
Launch Marimo from the Execute tab, run cells with Shift+Enter, then commit to Timeline to track your progress.

Key Concepts

Projects

A project contains everything for one research question: papers, datasets, notes, notebooks, and timeline. Each project is self-contained and can be shared with team members.

The AI Pipeline

Methodi uses AI at every step:

1. Paper Parsing — AI reads your PDF with scientific precision, preserving equations and tables
2. Research Assistant — Discuss methodology, ask questions, get citations from your papers
3. Notebook Architect — Generate documented, runnable code based on your plan
4. Execution — Run notebooks and track results with AI-generated summaries

Timeline

Like git for research. Every notebook you complete becomes a "commit" showing what data went in, what came out, and what decisions were made. This creates a traceable record of your methodology.

Workspace

Your project's file system. Notes, notebooks, and outputs all live here. You can organize files into folders and access them from anywhere.

Project Context (project.md)

Methodi uses a special project.md file at your workspace root to maintain project context across all conversations.

What is project.md?

project.md is the master project document that captures:

Research goal: What methodology you're replicating and why
Sample definition: Date ranges, filters, and universe selection
Key variables: How they're calculated and any adaptations made
Decisions made: Methodology choices and their justifications
Current status: What's been done and what's next

How It Works

1Automatic injection: project.md is automatically loaded into every Research Agent conversation as context
2Recent progress: Your last 3 notebook commits are also included, so the AI knows what you've accomplished
3AI updates: Ask the Research Agent to update project.md when major decisions are made

Updating project.md

In the Research Agent, simply ask:

"Update project.md with our decision to use quarterly MAX instead of 5-day MAX"

The AI will add the decision to the appropriate section and preserve existing content.

Why This Matters

Without project.md, every conversation starts from scratch. With it, the AI remembers your research goal, what data you have, and what decisions you've made—so you can pick up right where you left off.

Research Agent

The Research Agent is a unified AI assistant that handles methodology discussion, data exploration, and notebook generation—all with full access to your project context.

What Context Does the Agent Have?

Every conversation with the Research Agent includes:

project.md
Your research goals, sample definitions, variables, and decisions—automatically loaded
Recent Progress
Your last 3 notebook commits with AI summaries—so it knows what you've accomplished
Dataset Schemas
Column names, types, statistics, and your comments—accessed via tools
Paper Content
Full-text search across your uploaded PDFs with equation and table extraction
Workspace Files
Notes, notebooks, and outputs—accessible via read/write tools

What the Research Agent Can Do

Search papers: Semantically search your uploaded PDFs for methodology, definitions, and citations
Read your data: Access your workspace files and dataset schemas, including comments and tags you've added
Web search: Find relevant papers, documentation, and resources online
Save findings: Write markdown notes to preserve discussion outcomes and methodology decisions
Control notebooks: Read, edit, and execute cells in your running Marimo session—debug errors and iterate on code without leaving the chat

Uploading Papers

1. Go to the Research tab in your project
2. Click "Upload PDF" in the right sidebar
3. Select one or more PDF files
4. Wait for processing (status: pending → processing → ready)

Tip: Use publisher PDFs rather than scans for best results. The AI extracts equations, tables, and methodology sections.

Example Questions

"How do they define the MAX variable in Bali et al.?"
"What columns does my returns dataset have?"
"Search for recent papers on momentum crash risk"
"Save a summary of the methodology we discussed to notes"

The AI cites specific sections from your papers, making it easy to verify information.

Example Conversation

You:

"I want to study MAX but I only have monthly data. What are my options?"

AI:

"Looking at your dataset in data/returns_monthly.parquet... you have monthly returns from 2010-2023. For MAX without daily data, you could use Quarterly MAX (highest monthly return per quarter). Let me search for papers that validate this approach... Chen et al. (2019) found similar predictive power. Should I save these findings to notes? Or I can generate a notebook implementing this approach."

Execute

Launch Marimo in the cloud and run your notebooks. The Research Agent can debug and modify notebooks directly—no API key required.

Running Notebooks in the Cloud

1. Click Launch in the Execute tab to start Marimo
2. Open your notebook from the /workspace/notebooks/ folder
3. Run cells with Shift+Enter or click the play button
4. Save your notebook (Cmd/Ctrl+S) — files save directly to your workspace
5. Commit to Timeline to track your progress

AI-Powered Debugging (Recommended)

When you encounter an error, ask the Research Agent to help. It can read your notebook, see the error, fix the code, and re-run the cell—all from the Research tab.

You: "I got a KeyError in cell 5, can you fix it?"

Research Agent: "I see the error—the column is 'permno' not 'PERMNO'. I've fixed cell 5 and re-executed it. Your Marimo notebook will automatically update."

Why this is better: The Research Agent has full project context—your papers, datasets, and methodology decisions. It understands why you're doing the analysis, not just what the code does.

Timeline & Commits

The Timeline tracks your research progress like git tracks code.

What is a Commit?

A commit represents a completed notebook and records:

Inputs: What datasets were used
Outputs: What datasets were created
Message: What you did (your description or AI-generated)
Notebook: Link to the executed notebook

Creating a Commit

1. Go to Timeline in your project
2. Click "New Commit"
3. Select the notebook you completed
4. Confirm or edit the detected inputs/outputs
5. Add a commit message (or let AI generate one)
6. Click "Create Commit"

Why This Matters

Onboarding: New team members can understand months of work in minutes
Reproducibility: Trace any result back to its source
Documentation: AI summarizes every step automatically

Data Library

Upload, organize, and analyze your datasets with automatic profiling.

Uploading Data

Drag files into the Data Library or click "Upload".

What happens when you upload:

1. File is uploaded to secure cloud storage
2. Converted to Parquet format for efficient processing
3. Columns and types are detected automatically
4. Sample values are extracted for preview

Data Profiling

Click "Generate Insights" on any dataset to get:

Column statistics (min, max, mean, percentiles)
Panel structure analysis (entities, time periods, balance)
Data quality indicators (missing values, outliers)
Automatic ID detection (PERMNO, CUSIP, GVKEY, etc.)

Organization

Folders: Organize datasets into folders (e.g., "raw", "processed")
Tags: Classify datasets with custom tags for filtering
Comments: Add notes about data sources or transformations

Workspace

Your project's file browser. Manage notebooks, notes, data files, and outputs.

Default Folders

Every project starts with three key folders:

notes/Markdown files for research notes, methodology discussions, and documentation
notebooks/Marimo notebooks (.py files) generated by the Research Agent and your executed analyses
data/Intermediate datasets, outputs, and processed files from your notebooks

You can create additional folders to organize your work however you prefer.

What You Can Do

Browse files: Navigate folders in the left panel
Preview: Click a file to see its contents in the right panel
Upload: Drag files or click the upload button
Download: Click the download icon on any file
Create: Make new markdown notes or folders

For SAS/STATA Users

AI-First Workflow

Whether you're new to Python or an experienced programmer, Methodi accelerates your work by generating code from your descriptions. Your workflow is simple:

1. Describe your analysis in plain language
2. Run the generated code (Shift+Enter)
3. Verify the output matches your expectations

Key Differences

SAS/STATA	Marimo
Run entire program	Run one cell at a time (dependent cells update automatically)
Log window	Output appears inline with each cell
Separate data/code	Data and code together in notebook
Licensed software	Python is free and open source

Reading Python Output

DataFrames (like SAS datasets)

      permno       date       ret
0      10001 2010-01-04  0.023451
1      10001 2010-01-05 -0.012345
2      10001 2010-01-06  0.008901

This is like PROC PRINT output.

Summary Statistics

count    524382.000000
mean          0.031245
std           0.042156
min          -0.089234
50%           0.024567
max           0.234567

This is like PROC MEANS output.

Common Translations

SAS	STATA	Python (pandas)
data work.new;	gen newvar = ...	df['newvar'] = ...
proc sort	sort	df.sort_values()
merge	merge	pd.merge()
proc means	summarize	df.describe()
proc reg	regress	sm.OLS()

This table helps you read AI-generated code. You can also right-click any code cell and select "Explain" for a detailed breakdown.

Getting Help with Code

Use the Research Agent in the Research tab for AI assistance:

"Fix cell 3" — The agent reads the error and fixes the code directly
"Explain this code" — Get a plain-language explanation of what the code does
"Improve this approach" — Suggest optimizations based on your methodology

The Research Agent understands your project context—papers, datasets, and decisions—so it can give better suggestions.

Running Notebooks in Marimo

Methodi runs Marimo in the cloud—no local Python installation required. Your notebooks execute on our servers with all standard data science packages pre-installed.

Cloud Sandbox Environment

When you launch Marimo from the Execute tab, we spin up a cloud sandbox with:

Pre-installed packages: pandas, numpy, scipy, statsmodels, matplotlib, seaborn, scikit-learn, pyarrow, plotly
Your datasets: Mounted at /data/
Persistent workspace: Notebooks and outputs saved at /workspace/
AI debugging: Research Agent can edit and execute cells from the Research tab (no API key needed)
Reactive execution: Marimo automatically re-runs dependent cells when you make changes

Sessions timeout after 2 hours of inactivity. Your files persist across sessions.

Why Marimo?

Marimo is a next-generation Python notebook that fixes common pain points:

No hidden state: What you see is what you get—no stale variables from deleted cells
Pure Python files: Notebooks are stored as .py files, making them easy to version control and diff
Built-in UI elements: Interactive sliders, dropdowns, and tables without extra libraries
Reproducible by design: Run cells in any order and get the same result

The Most Important Skill: Shift+Enter

This runs the current cell. Marimo will automatically re-run any cells that depend on it.

Shortcut	Action
Shift + Enter	Run cell
Ctrl + Enter	Run cell, stay focused
Ctrl + S	Save notebook

The Basic Workflow

1. Read the markdown cell (understand what this step does)
2. Run the code cell (Shift+Enter or click the play button)
3. Check the output (Marimo shows results inline)
4. Continue to the next step—dependent cells update automatically

Getting Help

Use the Research Agent in the Research tab for AI assistance:

Fix errors: "I got an error in cell 3, can you fix it?"
Explain code: "What does this pandas operation do?"
Edit cells: "Change cell 5 to use a log transformation instead"

The Research Agent has full project context—your papers, datasets, and methodology decisions. It can edit cells directly in your notebook.

If a Cell Gets Stuck

1. Click the Stop button in the toolbar (⏹️)
2. If needed, use the Interrupt Kernel option from the menu
3. Edit your code and try again—Marimo will only re-run affected cells

How Billing Works

Methodi uses a simple pay-as-you-go model. You only pay for what you use.

Free Credits

$10.00 starting balance when you sign up
$10.00 monthly refresh every 30 days
No credit card required to start

Project-Based Billing

All usage within a project is billed to the project owner. This enables seamless team collaboration:

Team members work without needing their own balance
Centralized billing for easy expense tracking
Project owners can monitor usage in Settings → Billing

What Counts as Usage?

AI conversations: Research Agent chats and notebook generation
Paper parsing: Converting PDFs to searchable content
Notebook execution: Running JupyterLab in the cloud

Adding Funds

When your balance runs low, you can top up from Settings → Billing:

Minimum top-up: $10.00
Presets: $10, $25, $50, $100
Auto-recharge: Automatically add funds when balance drops below $2.00

Note: A minimum balance of $0.50 is required to start any billable action. This prevents interrupted operations.

Pricing

Transparent, pay-as-you-go pricing. No hidden fees.

Service	Price	Notes
AI Conversations	$0.015 / 1K input tokens $0.075 / 1K output tokens	Research, Plan, Execute agents
Paper Parsing	$0.10 / page	Scientific PDF extraction
Marimo Sandbox	$0.30 / hour	Cloud compute for notebooks
Storage	Free	Up to 100 GB per account

Free Tier

Every account gets $10.00 starting balance plus $10.00 monthly refresh. Most users never need to add funds.

Typical Usage Examples

$0.05A typical AI conversation (research question + response)
$1.00Parsing a 10-page research paper
$0.502 hours of Marimo notebook execution
$0.20Generating a complete methodology notebook

Supported File Formats

Type	Formats	Notes
Papers	PDF	Use publisher PDFs for best parsing
Data	CSV, TSV, Excel (.xlsx, .xls)	Converted to Parquet for processing
Data (Preferred)	Parquet	10x faster than CSV for large files
Notes	Markdown (.md)	Create and edit in Workspace
Notebooks	Marimo (.py)	Generated by Research Agent or created manually

Tip: For large financial datasets (CRSP, Compustat), convert to Parquet before uploading. It's much faster to process.

Frequently Asked Questions

Do I need to know Python?

Not required. AI generates the code, so you can focus on your research. That said, Python experience helps you customize and extend what AI creates.

What if the AI-generated code has an error?

Ask the Research Agent to fix it: "I got an error in cell 5, can you fix it?" The agent can read your notebook, see the error, edit the cell, and re-execute it—all from the Research tab. No API key required.

Can I edit the generated code?

Yes! The code is fully editable. Modify it however you like. If you break something, regenerate the notebook.

What's the largest file I can upload?

5GB per file. For very large datasets, use Parquet format—it's much faster to process than CSV.

Can I use R instead of Python?

Not currently. Methodi generates Python code using pandas and statsmodels.

How do I share results with someone outside my team?

Export notebooks as PDF or HTML from Marimo's export options, or share the .py file directly since Marimo notebooks are pure Python.

Methodi Documentation

Quick Start

Create a Project

Upload a Paper

Upload Your Data

Ask the Research Assistant

Generate a Notebook

Run and Commit

Key Concepts

Projects

The AI Pipeline

Timeline

Workspace

Project Context (project.md)

What is project.md?

How It Works

Updating project.md

Why This Matters

Research Agent

What Context Does the Agent Have?

What the Research Agent Can Do

Uploading Papers

Example Questions

Example Conversation

Execute

Running Notebooks in the Cloud

AI-Powered Debugging (Recommended)

Timeline & Commits

What is a Commit?

Creating a Commit

Why This Matters

Data Library

Uploading Data

Data Profiling

Organization

Workspace

Default Folders

What You Can Do

For SAS/STATA Users

AI-First Workflow

Key Differences

Reading Python Output

DataFrames (like SAS datasets)

Summary Statistics

Common Translations

Getting Help with Code

Running Notebooks in Marimo

Cloud Sandbox Environment

Why Marimo?

The Most Important Skill: Shift+Enter

The Basic Workflow

Getting Help

If a Cell Gets Stuck

How Billing Works

Free Credits

Project-Based Billing

What Counts as Usage?

Adding Funds

Pricing

Free Tier

Typical Usage Examples

Supported File Formats

Frequently Asked Questions

Do I need to know Python?

What if the AI-generated code has an error?

Can I edit the generated code?

What's the largest file I can upload?

Can I use R instead of Python?

How do I share results with someone outside my team?

External Resources

Marimo Documentation

Python for Finance (Optional)