Methodi Documentation
AI-powered platform for financial research—from literature review to reproducible analysis.
Quick Start
Get from paper to results in 5 minutes.
- 1
Create a Project
Click "New Project" from your dashboard. Give it a name that describes your research goal.
- 2
Upload a Paper
Go to the Research tab and upload your PDF. Methodi will parse it and index it for AI-powered search.
- 3
Upload Your Data
Go to Data Library and drag your CSV or Parquet file. Methodi will profile it automatically, detecting columns and data types.
- 4
Ask the Research Assistant
In the Research tab, ask: "What methodology does this paper use?" The AI will search the paper and explain.
- 5
Generate a Notebook
In the Research tab, ask the AI to generate a notebook. It will create a complete, runnable Marimo notebook saved to your workspace.
- 6
Run and Commit
Launch Marimo from the Execute tab, run cells with
Shift+Enter, then commit to Timeline to track your progress.
Key Concepts
Projects
A project contains everything for one research question: papers, datasets, notes, notebooks, and timeline. Each project is self-contained and can be shared with team members.
The AI Pipeline
Methodi uses AI at every step:
- 1. Paper Parsing — AI reads your PDF with scientific precision, preserving equations and tables
- 2. Research Assistant — Discuss methodology, ask questions, get citations from your papers
- 3. Notebook Architect — Generate documented, runnable code based on your plan
- 4. Execution — Run notebooks and track results with AI-generated summaries
Timeline
Like git for research. Every notebook you complete becomes a "commit" showing what data went in, what came out, and what decisions were made. This creates a traceable record of your methodology.
Workspace
Your project's file system. Notes, notebooks, and outputs all live here. You can organize files into folders and access them from anywhere.
Project Context (project.md)
Methodi uses a special project.md file at your workspace root to maintain project context across all conversations.
What is project.md?
project.md is the master project document that captures:
- Research goal: What methodology you're replicating and why
- Sample definition: Date ranges, filters, and universe selection
- Key variables: How they're calculated and any adaptations made
- Decisions made: Methodology choices and their justifications
- Current status: What's been done and what's next
How It Works
- 1Automatic injection: project.md is automatically loaded into every Research Agent conversation as context
- 2Recent progress: Your last 3 notebook commits are also included, so the AI knows what you've accomplished
- 3AI updates: Ask the Research Agent to update project.md when major decisions are made
Updating project.md
In the Research Agent, simply ask:
"Update project.md with our decision to use quarterly MAX instead of 5-day MAX"
The AI will add the decision to the appropriate section and preserve existing content.
Why This Matters
Without project.md, every conversation starts from scratch. With it, the AI remembers your research goal, what data you have, and what decisions you've made—so you can pick up right where you left off.
Research Agent
The Research Agent is a unified AI assistant that handles methodology discussion, data exploration, and notebook generation—all with full access to your project context.
What Context Does the Agent Have?
Every conversation with the Research Agent includes:
project.md
Your research goals, sample definitions, variables, and decisions—automatically loaded
Recent Progress
Your last 3 notebook commits with AI summaries—so it knows what you've accomplished
Dataset Schemas
Column names, types, statistics, and your comments—accessed via tools
Paper Content
Full-text search across your uploaded PDFs with equation and table extraction
Workspace Files
Notes, notebooks, and outputs—accessible via read/write tools
What the Research Agent Can Do
- Search papers: Semantically search your uploaded PDFs for methodology, definitions, and citations
- Read your data: Access your workspace files and dataset schemas, including comments and tags you've added
- Web search: Find relevant papers, documentation, and resources online
- Save findings: Write markdown notes to preserve discussion outcomes and methodology decisions
- Control notebooks: Read, edit, and execute cells in your running Marimo session—debug errors and iterate on code without leaving the chat
Uploading Papers
- 1. Go to the Research tab in your project
- 2. Click "Upload PDF" in the right sidebar
- 3. Select one or more PDF files
- 4. Wait for processing (status: pending → processing → ready)
Tip: Use publisher PDFs rather than scans for best results. The AI extracts equations, tables, and methodology sections.
Example Questions
- "How do they define the MAX variable in Bali et al.?"
- "What columns does my returns dataset have?"
- "Search for recent papers on momentum crash risk"
- "Save a summary of the methodology we discussed to notes"
The AI cites specific sections from your papers, making it easy to verify information.
Example Conversation
You:
"I want to study MAX but I only have monthly data. What are my options?"
AI:
"Looking at your dataset in data/returns_monthly.parquet... you have monthly returns from 2010-2023. For MAX without daily data, you could use Quarterly MAX (highest monthly return per quarter). Let me search for papers that validate this approach... Chen et al. (2019) found similar predictive power. Should I save these findings to notes? Or I can generate a notebook implementing this approach."
Execute
Launch Marimo in the cloud and run your notebooks. The Research Agent can debug and modify notebooks directly—no API key required.
Running Notebooks in the Cloud
- 1. Click Launch in the Execute tab to start Marimo
- 2. Open your notebook from the
/workspace/notebooks/folder - 3. Run cells with
Shift+Enteror click the play button - 4. Save your notebook (
Cmd/Ctrl+S) — files save directly to your workspace - 5. Commit to Timeline to track your progress
AI-Powered Debugging (Recommended)
When you encounter an error, ask the Research Agent to help. It can read your notebook, see the error, fix the code, and re-run the cell—all from the Research tab.
You: "I got a KeyError in cell 5, can you fix it?"
Research Agent: "I see the error—the column is 'permno' not 'PERMNO'. I've fixed cell 5 and re-executed it. Your Marimo notebook will automatically update."
Why this is better: The Research Agent has full project context—your papers, datasets, and methodology decisions. It understands why you're doing the analysis, not just what the code does.
Timeline & Commits
The Timeline tracks your research progress like git tracks code.
What is a Commit?
A commit represents a completed notebook and records:
- Inputs: What datasets were used
- Outputs: What datasets were created
- Message: What you did (your description or AI-generated)
- Notebook: Link to the executed notebook
Creating a Commit
- 1. Go to Timeline in your project
- 2. Click "New Commit"
- 3. Select the notebook you completed
- 4. Confirm or edit the detected inputs/outputs
- 5. Add a commit message (or let AI generate one)
- 6. Click "Create Commit"
Why This Matters
- Onboarding: New team members can understand months of work in minutes
- Reproducibility: Trace any result back to its source
- Documentation: AI summarizes every step automatically
Data Library
Upload, organize, and analyze your datasets with automatic profiling.
Uploading Data
Drag files into the Data Library or click "Upload".
What happens when you upload:
- 1. File is uploaded to secure cloud storage
- 2. Converted to Parquet format for efficient processing
- 3. Columns and types are detected automatically
- 4. Sample values are extracted for preview
Data Profiling
Click "Generate Insights" on any dataset to get:
- Column statistics (min, max, mean, percentiles)
- Panel structure analysis (entities, time periods, balance)
- Data quality indicators (missing values, outliers)
- Automatic ID detection (PERMNO, CUSIP, GVKEY, etc.)
Organization
- Folders: Organize datasets into folders (e.g., "raw", "processed")
- Tags: Classify datasets with custom tags for filtering
- Comments: Add notes about data sources or transformations
Workspace
Your project's file browser. Manage notebooks, notes, data files, and outputs.
Default Folders
Every project starts with three key folders:
notes/Markdown files for research notes, methodology discussions, and documentationnotebooks/Marimo notebooks (.py files) generated by the Research Agent and your executed analysesdata/Intermediate datasets, outputs, and processed files from your notebooks
You can create additional folders to organize your work however you prefer.
What You Can Do
- Browse files: Navigate folders in the left panel
- Preview: Click a file to see its contents in the right panel
- Upload: Drag files or click the upload button
- Download: Click the download icon on any file
- Create: Make new markdown notes or folders
For SAS/STATA Users
AI-First Workflow
Whether you're new to Python or an experienced programmer, Methodi accelerates your work by generating code from your descriptions. Your workflow is simple:
- 1. Describe your analysis in plain language
- 2. Run the generated code (Shift+Enter)
- 3. Verify the output matches your expectations
Key Differences
| SAS/STATA | Marimo |
|---|---|
| Run entire program | Run one cell at a time (dependent cells update automatically) |
| Log window | Output appears inline with each cell |
| Separate data/code | Data and code together in notebook |
| Licensed software | Python is free and open source |
Reading Python Output
DataFrames (like SAS datasets)
permno date ret 0 10001 2010-01-04 0.023451 1 10001 2010-01-05 -0.012345 2 10001 2010-01-06 0.008901
This is like PROC PRINT output.
Summary Statistics
count 524382.000000 mean 0.031245 std 0.042156 min -0.089234 50% 0.024567 max 0.234567
This is like PROC MEANS output.
Common Translations
| SAS | STATA | Python (pandas) |
|---|---|---|
| data work.new; | gen newvar = ... | df['newvar'] = ... |
| proc sort | sort | df.sort_values() |
| merge | merge | pd.merge() |
| proc means | summarize | df.describe() |
| proc reg | regress | sm.OLS() |
This table helps you read AI-generated code. You can also right-click any code cell and select "Explain" for a detailed breakdown.
Getting Help with Code
Use the Research Agent in the Research tab for AI assistance:
- "Fix cell 3" — The agent reads the error and fixes the code directly
- "Explain this code" — Get a plain-language explanation of what the code does
- "Improve this approach" — Suggest optimizations based on your methodology
The Research Agent understands your project context—papers, datasets, and decisions—so it can give better suggestions.
Running Notebooks in Marimo
Methodi runs Marimo in the cloud—no local Python installation required. Your notebooks execute on our servers with all standard data science packages pre-installed.
Cloud Sandbox Environment
When you launch Marimo from the Execute tab, we spin up a cloud sandbox with:
- Pre-installed packages: pandas, numpy, scipy, statsmodels, matplotlib, seaborn, scikit-learn, pyarrow, plotly
- Your datasets: Mounted at
/data/ - Persistent workspace: Notebooks and outputs saved at
/workspace/ - AI debugging: Research Agent can edit and execute cells from the Research tab (no API key needed)
- Reactive execution: Marimo automatically re-runs dependent cells when you make changes
Sessions timeout after 2 hours of inactivity. Your files persist across sessions.
Why Marimo?
Marimo is a next-generation Python notebook that fixes common pain points:
- No hidden state: What you see is what you get—no stale variables from deleted cells
- Pure Python files: Notebooks are stored as .py files, making them easy to version control and diff
- Built-in UI elements: Interactive sliders, dropdowns, and tables without extra libraries
- Reproducible by design: Run cells in any order and get the same result
The Most Important Skill: Shift+Enter
This runs the current cell. Marimo will automatically re-run any cells that depend on it.
| Shortcut | Action |
|---|---|
| Shift + Enter | Run cell |
| Ctrl + Enter | Run cell, stay focused |
| Ctrl + S | Save notebook |
The Basic Workflow
- 1. Read the markdown cell (understand what this step does)
- 2. Run the code cell (Shift+Enter or click the play button)
- 3. Check the output (Marimo shows results inline)
- 4. Continue to the next step—dependent cells update automatically
Getting Help
Use the Research Agent in the Research tab for AI assistance:
- Fix errors: "I got an error in cell 3, can you fix it?"
- Explain code: "What does this pandas operation do?"
- Edit cells: "Change cell 5 to use a log transformation instead"
The Research Agent has full project context—your papers, datasets, and methodology decisions. It can edit cells directly in your notebook.
If a Cell Gets Stuck
- 1. Click the Stop button in the toolbar (⏹️)
- 2. If needed, use the Interrupt Kernel option from the menu
- 3. Edit your code and try again—Marimo will only re-run affected cells
How Billing Works
Methodi uses a simple pay-as-you-go model. You only pay for what you use.
Free Credits
- $10.00 starting balance when you sign up
- $10.00 monthly refresh every 30 days
- No credit card required to start
Project-Based Billing
All usage within a project is billed to the project owner. This enables seamless team collaboration:
- Team members work without needing their own balance
- Centralized billing for easy expense tracking
- Project owners can monitor usage in Settings → Billing
What Counts as Usage?
- AI conversations: Research Agent chats and notebook generation
- Paper parsing: Converting PDFs to searchable content
- Notebook execution: Running JupyterLab in the cloud
Adding Funds
When your balance runs low, you can top up from Settings → Billing:
- Minimum top-up: $10.00
- Presets: $10, $25, $50, $100
- Auto-recharge: Automatically add funds when balance drops below $2.00
Note: A minimum balance of $0.50 is required to start any billable action. This prevents interrupted operations.
Pricing
Transparent, pay-as-you-go pricing. No hidden fees.
| Service | Price | Notes |
|---|---|---|
| AI Conversations | $0.015 / 1K input tokens $0.075 / 1K output tokens | Research, Plan, Execute agents |
| Paper Parsing | $0.10 / page | Scientific PDF extraction |
| Marimo Sandbox | $0.30 / hour | Cloud compute for notebooks |
| Storage | Free | Up to 100 GB per account |
Free Tier
Every account gets $10.00 starting balance plus $10.00 monthly refresh. Most users never need to add funds.
Typical Usage Examples
- $0.05A typical AI conversation (research question + response)
- $1.00Parsing a 10-page research paper
- $0.502 hours of Marimo notebook execution
- $0.20Generating a complete methodology notebook
Supported File Formats
| Type | Formats | Notes |
|---|---|---|
| Papers | Use publisher PDFs for best parsing | |
| Data | CSV, TSV, Excel (.xlsx, .xls) | Converted to Parquet for processing |
| Data (Preferred) | Parquet | 10x faster than CSV for large files |
| Notes | Markdown (.md) | Create and edit in Workspace |
| Notebooks | Marimo (.py) | Generated by Research Agent or created manually |
Tip: For large financial datasets (CRSP, Compustat), convert to Parquet before uploading. It's much faster to process.
Frequently Asked Questions
Do I need to know Python?
Not required. AI generates the code, so you can focus on your research. That said, Python experience helps you customize and extend what AI creates.
What if the AI-generated code has an error?
Ask the Research Agent to fix it: "I got an error in cell 5, can you fix it?" The agent can read your notebook, see the error, edit the cell, and re-execute it—all from the Research tab. No API key required.
Can I edit the generated code?
Yes! The code is fully editable. Modify it however you like. If you break something, regenerate the notebook.
What's the largest file I can upload?
5GB per file. For very large datasets, use Parquet format—it's much faster to process than CSV.
Can I use R instead of Python?
Not currently. Methodi generates Python code using pandas and statsmodels.
How do I share results with someone outside my team?
Export notebooks as PDF or HTML from Marimo's export options, or share the .py file directly since Marimo notebooks are pure Python.
External Resources
Learn more about Marimo and Python from these excellent resources.
Marimo Documentation
Python for Finance (Optional)
If you want to understand the code better: