Back to Dashboard

Methodi Documentation

AI-powered platform for financial research—from literature review to reproducible analysis.

Quick Start

Get from paper to results in 5 minutes.

  1. 1

    Create a Project

    Click "New Project" from your dashboard. Give it a name that describes your research goal.

  2. 2

    Upload a Paper

    Go to the Research tab and upload your PDF. Methodi will parse it and index it for AI-powered search.

  3. 3

    Upload Your Data

    Go to Data Library and drag your CSV or Parquet file. Methodi will profile it automatically, detecting columns and data types.

  4. 4

    Ask the Research Assistant

    In the Research tab, ask: "What methodology does this paper use?" The AI will search the paper and explain.

  5. 5

    Generate a Notebook

    In the Research tab, ask the AI to generate a notebook. It will create a complete, runnable Marimo notebook saved to your workspace.

  6. 6

    Run and Commit

    Launch Marimo from the Execute tab, run cells with Shift+Enter, then commit to Timeline to track your progress.

Key Concepts

Projects

A project contains everything for one research question: papers, datasets, notes, notebooks, and timeline. Each project is self-contained and can be shared with team members.

The AI Pipeline

Methodi uses AI at every step:

  1. 1. Paper Parsing — AI reads your PDF with scientific precision, preserving equations and tables
  2. 2. Research Assistant — Discuss methodology, ask questions, get citations from your papers
  3. 3. Notebook Architect — Generate documented, runnable code based on your plan
  4. 4. Execution — Run notebooks and track results with AI-generated summaries

Timeline

Like git for research. Every notebook you complete becomes a "commit" showing what data went in, what came out, and what decisions were made. This creates a traceable record of your methodology.

Workspace

Your project's file system. Notes, notebooks, and outputs all live here. You can organize files into folders and access them from anywhere.

Project Context (project.md)

Methodi uses a special project.md file at your workspace root to maintain project context across all conversations.

What is project.md?

project.md is the master project document that captures:

  • Research goal: What methodology you're replicating and why
  • Sample definition: Date ranges, filters, and universe selection
  • Key variables: How they're calculated and any adaptations made
  • Decisions made: Methodology choices and their justifications
  • Current status: What's been done and what's next

How It Works

  • 1Automatic injection: project.md is automatically loaded into every Research Agent conversation as context
  • 2Recent progress: Your last 3 notebook commits are also included, so the AI knows what you've accomplished
  • 3AI updates: Ask the Research Agent to update project.md when major decisions are made

Updating project.md

In the Research Agent, simply ask:

"Update project.md with our decision to use quarterly MAX instead of 5-day MAX"

The AI will add the decision to the appropriate section and preserve existing content.

Why This Matters

Without project.md, every conversation starts from scratch. With it, the AI remembers your research goal, what data you have, and what decisions you've made—so you can pick up right where you left off.

Research Agent

The Research Agent is a unified AI assistant that handles methodology discussion, data exploration, and notebook generation—all with full access to your project context.

What Context Does the Agent Have?

Every conversation with the Research Agent includes:

  • project.md

    Your research goals, sample definitions, variables, and decisions—automatically loaded

  • Recent Progress

    Your last 3 notebook commits with AI summaries—so it knows what you've accomplished

  • Dataset Schemas

    Column names, types, statistics, and your comments—accessed via tools

  • Paper Content

    Full-text search across your uploaded PDFs with equation and table extraction

  • Workspace Files

    Notes, notebooks, and outputs—accessible via read/write tools

What the Research Agent Can Do

  • Search papers: Semantically search your uploaded PDFs for methodology, definitions, and citations
  • Read your data: Access your workspace files and dataset schemas, including comments and tags you've added
  • Web search: Find relevant papers, documentation, and resources online
  • Save findings: Write markdown notes to preserve discussion outcomes and methodology decisions
  • Control notebooks: Read, edit, and execute cells in your running Marimo session—debug errors and iterate on code without leaving the chat

Uploading Papers

  1. 1. Go to the Research tab in your project
  2. 2. Click "Upload PDF" in the right sidebar
  3. 3. Select one or more PDF files
  4. 4. Wait for processing (status: pending → processing → ready)

Tip: Use publisher PDFs rather than scans for best results. The AI extracts equations, tables, and methodology sections.

Example Questions

  • "How do they define the MAX variable in Bali et al.?"
  • "What columns does my returns dataset have?"
  • "Search for recent papers on momentum crash risk"
  • "Save a summary of the methodology we discussed to notes"

The AI cites specific sections from your papers, making it easy to verify information.

Example Conversation

You:

"I want to study MAX but I only have monthly data. What are my options?"

AI:

"Looking at your dataset in data/returns_monthly.parquet... you have monthly returns from 2010-2023. For MAX without daily data, you could use Quarterly MAX (highest monthly return per quarter). Let me search for papers that validate this approach... Chen et al. (2019) found similar predictive power. Should I save these findings to notes? Or I can generate a notebook implementing this approach."

Execute

Launch Marimo in the cloud and run your notebooks. The Research Agent can debug and modify notebooks directly—no API key required.

Running Notebooks in the Cloud

  1. 1. Click Launch in the Execute tab to start Marimo
  2. 2. Open your notebook from the /workspace/notebooks/ folder
  3. 3. Run cells with Shift+Enter or click the play button
  4. 4. Save your notebook (Cmd/Ctrl+S) — files save directly to your workspace
  5. 5. Commit to Timeline to track your progress

AI-Powered Debugging (Recommended)

When you encounter an error, ask the Research Agent to help. It can read your notebook, see the error, fix the code, and re-run the cell—all from the Research tab.

You: "I got a KeyError in cell 5, can you fix it?"

Research Agent: "I see the error—the column is 'permno' not 'PERMNO'. I've fixed cell 5 and re-executed it. Your Marimo notebook will automatically update."

Why this is better: The Research Agent has full project context—your papers, datasets, and methodology decisions. It understands why you're doing the analysis, not just what the code does.

Timeline & Commits

The Timeline tracks your research progress like git tracks code.

What is a Commit?

A commit represents a completed notebook and records:

  • Inputs: What datasets were used
  • Outputs: What datasets were created
  • Message: What you did (your description or AI-generated)
  • Notebook: Link to the executed notebook

Creating a Commit

  1. 1. Go to Timeline in your project
  2. 2. Click "New Commit"
  3. 3. Select the notebook you completed
  4. 4. Confirm or edit the detected inputs/outputs
  5. 5. Add a commit message (or let AI generate one)
  6. 6. Click "Create Commit"

Why This Matters

  • Onboarding: New team members can understand months of work in minutes
  • Reproducibility: Trace any result back to its source
  • Documentation: AI summarizes every step automatically

Data Library

Upload, organize, and analyze your datasets with automatic profiling.

Uploading Data

Drag files into the Data Library or click "Upload".

What happens when you upload:

  1. 1. File is uploaded to secure cloud storage
  2. 2. Converted to Parquet format for efficient processing
  3. 3. Columns and types are detected automatically
  4. 4. Sample values are extracted for preview

Data Profiling

Click "Generate Insights" on any dataset to get:

  • Column statistics (min, max, mean, percentiles)
  • Panel structure analysis (entities, time periods, balance)
  • Data quality indicators (missing values, outliers)
  • Automatic ID detection (PERMNO, CUSIP, GVKEY, etc.)

Organization

  • Folders: Organize datasets into folders (e.g., "raw", "processed")
  • Tags: Classify datasets with custom tags for filtering
  • Comments: Add notes about data sources or transformations

Workspace

Your project's file browser. Manage notebooks, notes, data files, and outputs.

Default Folders

Every project starts with three key folders:

  • notes/Markdown files for research notes, methodology discussions, and documentation
  • notebooks/Marimo notebooks (.py files) generated by the Research Agent and your executed analyses
  • data/Intermediate datasets, outputs, and processed files from your notebooks

You can create additional folders to organize your work however you prefer.

What You Can Do

  • Browse files: Navigate folders in the left panel
  • Preview: Click a file to see its contents in the right panel
  • Upload: Drag files or click the upload button
  • Download: Click the download icon on any file
  • Create: Make new markdown notes or folders

For SAS/STATA Users

AI-First Workflow

Whether you're new to Python or an experienced programmer, Methodi accelerates your work by generating code from your descriptions. Your workflow is simple:

  1. 1. Describe your analysis in plain language
  2. 2. Run the generated code (Shift+Enter)
  3. 3. Verify the output matches your expectations

Key Differences

SAS/STATAMarimo
Run entire programRun one cell at a time (dependent cells update automatically)
Log windowOutput appears inline with each cell
Separate data/codeData and code together in notebook
Licensed softwarePython is free and open source

Reading Python Output

DataFrames (like SAS datasets)

      permno       date       ret
0      10001 2010-01-04  0.023451
1      10001 2010-01-05 -0.012345
2      10001 2010-01-06  0.008901

This is like PROC PRINT output.

Summary Statistics

count    524382.000000
mean          0.031245
std           0.042156
min          -0.089234
50%           0.024567
max           0.234567

This is like PROC MEANS output.

Common Translations

SASSTATAPython (pandas)
data work.new;gen newvar = ...df['newvar'] = ...
proc sortsortdf.sort_values()
mergemergepd.merge()
proc meanssummarizedf.describe()
proc regregresssm.OLS()

This table helps you read AI-generated code. You can also right-click any code cell and select "Explain" for a detailed breakdown.

Getting Help with Code

Use the Research Agent in the Research tab for AI assistance:

  • "Fix cell 3" — The agent reads the error and fixes the code directly
  • "Explain this code" — Get a plain-language explanation of what the code does
  • "Improve this approach" — Suggest optimizations based on your methodology

The Research Agent understands your project context—papers, datasets, and decisions—so it can give better suggestions.

Running Notebooks in Marimo

Methodi runs Marimo in the cloud—no local Python installation required. Your notebooks execute on our servers with all standard data science packages pre-installed.

Cloud Sandbox Environment

When you launch Marimo from the Execute tab, we spin up a cloud sandbox with:

  • Pre-installed packages: pandas, numpy, scipy, statsmodels, matplotlib, seaborn, scikit-learn, pyarrow, plotly
  • Your datasets: Mounted at /data/
  • Persistent workspace: Notebooks and outputs saved at /workspace/
  • AI debugging: Research Agent can edit and execute cells from the Research tab (no API key needed)
  • Reactive execution: Marimo automatically re-runs dependent cells when you make changes

Sessions timeout after 2 hours of inactivity. Your files persist across sessions.

Why Marimo?

Marimo is a next-generation Python notebook that fixes common pain points:

  • No hidden state: What you see is what you get—no stale variables from deleted cells
  • Pure Python files: Notebooks are stored as .py files, making them easy to version control and diff
  • Built-in UI elements: Interactive sliders, dropdowns, and tables without extra libraries
  • Reproducible by design: Run cells in any order and get the same result

The Most Important Skill: Shift+Enter

This runs the current cell. Marimo will automatically re-run any cells that depend on it.

ShortcutAction
Shift + EnterRun cell
Ctrl + EnterRun cell, stay focused
Ctrl + SSave notebook

The Basic Workflow

  1. 1. Read the markdown cell (understand what this step does)
  2. 2. Run the code cell (Shift+Enter or click the play button)
  3. 3. Check the output (Marimo shows results inline)
  4. 4. Continue to the next step—dependent cells update automatically

Getting Help

Use the Research Agent in the Research tab for AI assistance:

  • Fix errors: "I got an error in cell 3, can you fix it?"
  • Explain code: "What does this pandas operation do?"
  • Edit cells: "Change cell 5 to use a log transformation instead"

The Research Agent has full project context—your papers, datasets, and methodology decisions. It can edit cells directly in your notebook.

If a Cell Gets Stuck

  1. 1. Click the Stop button in the toolbar (⏹️)
  2. 2. If needed, use the Interrupt Kernel option from the menu
  3. 3. Edit your code and try again—Marimo will only re-run affected cells

How Billing Works

Methodi uses a simple pay-as-you-go model. You only pay for what you use.

Free Credits

  • $10.00 starting balance when you sign up
  • $10.00 monthly refresh every 30 days
  • No credit card required to start

Project-Based Billing

All usage within a project is billed to the project owner. This enables seamless team collaboration:

  • Team members work without needing their own balance
  • Centralized billing for easy expense tracking
  • Project owners can monitor usage in Settings → Billing

What Counts as Usage?

  • AI conversations: Research Agent chats and notebook generation
  • Paper parsing: Converting PDFs to searchable content
  • Notebook execution: Running JupyterLab in the cloud

Adding Funds

When your balance runs low, you can top up from Settings → Billing:

  • Minimum top-up: $10.00
  • Presets: $10, $25, $50, $100
  • Auto-recharge: Automatically add funds when balance drops below $2.00

Note: A minimum balance of $0.50 is required to start any billable action. This prevents interrupted operations.

Pricing

Transparent, pay-as-you-go pricing. No hidden fees.

ServicePriceNotes
AI Conversations
$0.015 / 1K input tokens
$0.075 / 1K output tokens
Research, Plan, Execute agents
Paper Parsing$0.10 / pageScientific PDF extraction
Marimo Sandbox$0.30 / hourCloud compute for notebooks
StorageFreeUp to 100 GB per account

Free Tier

Every account gets $10.00 starting balance plus $10.00 monthly refresh. Most users never need to add funds.

Typical Usage Examples

  • $0.05A typical AI conversation (research question + response)
  • $1.00Parsing a 10-page research paper
  • $0.502 hours of Marimo notebook execution
  • $0.20Generating a complete methodology notebook

Supported File Formats

TypeFormatsNotes
PapersPDFUse publisher PDFs for best parsing
DataCSV, TSV, Excel (.xlsx, .xls)Converted to Parquet for processing
Data (Preferred)Parquet10x faster than CSV for large files
NotesMarkdown (.md)Create and edit in Workspace
NotebooksMarimo (.py)Generated by Research Agent or created manually

Tip: For large financial datasets (CRSP, Compustat), convert to Parquet before uploading. It's much faster to process.

Frequently Asked Questions

Do I need to know Python?

Not required. AI generates the code, so you can focus on your research. That said, Python experience helps you customize and extend what AI creates.

What if the AI-generated code has an error?

Ask the Research Agent to fix it: "I got an error in cell 5, can you fix it?" The agent can read your notebook, see the error, edit the cell, and re-execute it—all from the Research tab. No API key required.

Can I edit the generated code?

Yes! The code is fully editable. Modify it however you like. If you break something, regenerate the notebook.

What's the largest file I can upload?

5GB per file. For very large datasets, use Parquet format—it's much faster to process than CSV.

Can I use R instead of Python?

Not currently. Methodi generates Python code using pandas and statsmodels.

How do I share results with someone outside my team?

Export notebooks as PDF or HTML from Marimo's export options, or share the .py file directly since Marimo notebooks are pure Python.

External Resources

Learn more about Marimo and Python from these excellent resources.