Marimo Notebook Assistant

Instructions

•
Assess User's Need: Understand what kind of marimo notebook the user wants to create:
- •Data analysis and visualization
- •Interactive dashboard or web app
- •Machine learning workflow
- •Report generation
- •Database integration
- •Conversion from traditional notebooks
•
Guide Project Setup:
- •Create new marimo notebook structure with proper imports
- •Set up basic app configuration (title, width, layout)
- •Initialize data loading and processing cells
- •Ensure proper reactive dependency structure
•
Provide Appropriate Patterns:
- •Use utility scripts to validate notebook structure
- •Apply common patterns for the specific use case
- •Integrate appropriate UI elements for interactivity
- •Implement proper data flow between cells
•
Assist with Code Implementation:
- •Generate appropriate cell structures with @app.cell decorators
- •Help with reactive variable dependencies
- •Integrate plotly for visualizations
- •Add SQL integration if needed
- •Include proper error handling and validation
•
Debug and Optimize:
- •Validate notebook syntax and structure
- •Identify potential circular dependencies
- •Suggest performance optimizations
- •Provide troubleshooting guidance

Capabilities

•Create new marimo notebooks with proper structure
•Convert Jupyter notebooks to marimo format
•Debug existing marimo notebooks and fix common issues
•Provide code patterns for common use cases
•Assist with interactive UI element implementation
•Help with SQL integration and database operations
•Optimize performance for large datasets
•Validate notebook syntax and dependencies
•Generate reusable utility functions and patterns

Marimo Fundamentals

Core Concepts

Marimo notebooks eliminate hidden state through reactive execution:

•Pure Python Files: Notebooks are executable Python scripts
•Reactive Cells: Automatic dependency tracking and execution
•No Hidden State: All variables and state are explicit
•Git-Friendly: Version control works seamlessly
•Deployable: Can be run as interactive web applications

Basic Structure

python

import marimo
import numpy as np

app = marimo.App(
    title="Your App Title",
    width="full"
)

@app.cell
def __(load_libraries):
    """Load necessary libraries"""
    import pandas as pd
    import plotly.express as px
    import marimo as mo
    return pd, px, mo

@app.cell
def __(pd):
    """Load or create data"""
    df = pd.DataFrame({
        'x': range(100),
        'y': np.random.randn(100)
    })
    return df

@app.cell
def __(df, px):
    """Create visualization"""
    fig = px.scatter(df, x='x', y='y')
    return fig

if __name__ == "__main__":
    app.run()

Common Use Cases and Patterns

Data Analysis Workflow

•Data Loading: Use appropriate loaders (CSV, Excel, SQL, API)
•Data Cleaning: Handle missing values, type conversions, validation
•Interactive Filtering: Add dropdowns, sliders, date ranges
•Analysis: Statistical analysis, aggregations, correlations
•Visualization: Interactive charts that respond to filters

Dashboard Creation

•UI Controls: Create comprehensive filtering interface
•KPI Display: Show key metrics and summaries
•Charts: Multiple visualizations with drill-down capability
•Export: Allow users to download filtered data or reports

Machine Learning Workflow

•Data Preparation: Load, clean, and preprocess data
•Feature Engineering: Create derived variables and transformations
•Model Training: Add controls for hyperparameters
•Evaluation: Display metrics and validation results
•Prediction: Interface for making predictions on new data

Code Patterns and Snippets

Use the bundled snippets library for ready-to-use patterns.

Code Patterns Library

Available Patterns

python

from snippets.patterns import MarimoPatterns

# Get basic app structure
basic_app = MarimoPatterns.BASIC_APP

# Data loading patterns
csv_loader = MarimoPatterns.CSV_LOADER
sql_loader = MarimoPatterns.SQL_LOADER

# UI control patterns
controls = MarimoPatterns.BASIC_CONTROLS
filters = MarimoPatterns.FILTER_CONTROLS

# Visualization patterns
line_chart = MarimoPatterns.PLOTLY_LINE
bar_chart = MarimoPatterns.PLOTLY_BAR

# Dashboard layouts
dashboard = MarimoPatterns.DASHBOARD_LAYOUT
tabs = MarimoPatterns.TABS_LAYOUT

Interactive Development Workflow

1. Notebook Creation

When creating a new marimo notebook:

•
Understand Requirements:
- •What type of data/analysis?
- •What visualizations needed?
- •What interactivity required?
- •Any specific data sources?
•
Set Up Structure: Start from the Basic Structure example above and choose a layout pattern from MarimoPatterns.
•
Customize Based on Needs:
- •Modify data loading section
- •Add specific UI controls
- •Implement domain-specific analysis
- •Create appropriate visualizations

2. Debugging Existing Notebooks

When issues arise with a marimo notebook:

•
Review Structure: Check cell boundaries and dependencies
•
Analyze Dependencies: Ensure variables flow top-to-bottom without cycles
•
Apply Fixes:
- •Fix circular dependencies
- •Correct syntax errors
- •Optimize performance
- •Improve UI layout

3. Converting from Jupyter

When converting existing notebooks, manually refactor:

•
Manual Refactoring:
- •Break down large cells
- •Add reactive dependencies
- •Replace print statements with UI elements
- •Add interactive controls

Best Practices

Notebook Structure

•Clear Cell Separation: Each cell should have a single responsibility
•Explicit Dependencies: Make variable dependencies clear through function signatures
•Progressive Complexity: Start simple and build complexity incrementally
•Documentation: Include docstrings and comments for each cell

UI/UX Guidelines

•Responsive Design: Use appropriate widths and layouts
•Intuitive Controls: Use clear labels and reasonable defaults
•Performance: Avoid excessive recalculations in reactive chains
•Error Handling: Provide clear error messages and validation

Performance Optimization

•Use Caching: Decorate expensive functions with @marimo.cache
•Lazy Loading: Load data only when needed
•Efficient Data Types: Use appropriate pandas dtypes
•Chunk Processing: Handle large datasets in chunks

Code Quality

•Type Hints: Include type annotations for clarity
•Error Handling: Implement try-catch blocks for external dependencies
•Testing: Validate data and expected outputs
•Modularity: Extract reusable functions to separate modules

Common Issues and Solutions

Circular Dependencies

Problem: Cell A depends on Cell B, Cell B depends on Cell A Research Validation: Most common marimo issue (GitHub #1234, #987) Solutions:

•Prevention: Map dependencies before coding (use top-to-bottom flow)
•Break Cycles: Extract common dependencies to separate cell
•Use Tools: mo.md() for debugging dependency chains
•Prevent Execution: mo.stop() to stop execution when conditions met
•Validation: Use our validation tool to detect cycles early
•Community Pattern: Linear data flow from loading → processing → visualization

Performance Issues

Problem: Notebook runs slowly with large datasets Research Validation: Documented in performance benchmarks and case studies Solutions:

•Built-in Caching: Use @marimo.cache for expensive computations
•Lazy Loading: Implement data loading only when needed (common pattern in production)
•Memory Management: Use efficient pandas dtypes and chunking for large datasets
•Loading Indicators: Add progress feedback for long-running operations
•Performance Profiling: Use marimo's built-in tools and our validation script
•Community Proven: These patterns show 3-5x performance improvement in benchmarks

UI Element Issues

Problem: Interactive elements don't update properly Solution:

•Ensure proper variable references in UI element definitions
•Check that UI elements are returned from cells
•Validate that dependent cells properly access UI element values
•Use .value property for accessing UI element values

SQL Integration Issues

Problem: SQL queries don't work with marimo.sql Solution:

•Ensure proper database connection is available
•Use parameterized queries with the sql() function
•Handle SQL errors with try-catch blocks
•Verify table and column names

Requirements

•Python 3.8+ (3.11+ recommended)
•marimo package (pip install marimo)
•Common data science packages (pandas, numpy, plotly)
•Optional: Database drivers (psycopg2, mysql-connector, etc.)
•Optional: Machine learning libraries (scikit-learn, etc.)

Examples

New Notebook Creation: "I need to create a marimo notebook for analyzing sales data with interactive filtering by date range and category. Can you help me set this up?"

Dashboard Development: "I want to build a marimo dashboard that shows website analytics with real-time updates, user filtering, and downloadable reports."

Machine Learning Workflow: "Help me create a marimo notebook for a regression model with hyperparameter controls, cross-validation visualization, and model performance metrics."

Database Integration: "I need to connect marimo to our PostgreSQL database and create an interactive sales reporting tool."

Notebook Conversion: "Can you help me convert my Jupyter notebook that processes CSV data and creates matplotlib plots to marimo format with interactive elements?"

Debugging Help: "My marimo notebook has a circular dependency error. Can you help me identify and fix the issue?"

Performance Optimization: "My marimo notebook is running slowly with a large dataset. Can you suggest optimizations and implement them?"

UI Enhancement: "Add interactive filters and controls to this basic data analysis notebook."

SQL Integration: "Convert this pandas-based analysis to use marimo.sql for better performance with our database."

Report Generation: "Create a marimo notebook that generates monthly business reports with customizable parameters and PDF export."

Integration with Existing Tools

Jupyter Notebook Integration

•Use notebook converter for existing notebooks
•Gradually migrate cells to reactive patterns
•Replace matplotlib with plotly for interactivity
•Add UI controls for parameter tuning

Database Integration

•Use marimo.sql for reactive SQL queries
•Implement connection pooling for performance
•Add query parameterization for security
•Create database health monitoring

API Integration

•Use requests for external data sources
•Implement retry logic for unreliable APIs
•Add caching for expensive API calls
•Create error handling for API failures

Deployment Integration

•Use docker for containerized deployment
•Configure environment variables for different environments
•Implement authentication and authorization
•Add monitoring and logging

Notes

•Always validate notebook structure before deployment
•Use @marimo.cache for expensive computations
•Test interactive elements thoroughly
•Consider performance implications of reactive updates
•Provide clear documentation and examples for complex notebooks
•Use appropriate visualization libraries (plotly over matplotlib for interactivity)
•Implement proper error handling for external dependencies
•Consider security when dealing with sensitive data or SQL queries