simple-code-execution

A powerful Python library for executing code predictions through subprocess and threading with comprehensive parallel processing, file management, and result handling.

🚀 Features

⚡ Parallel Execution: Execute multiple code predictions simultaneously using multiprocessing
📁 Automatic File Management: Write, execute, and cleanup temporary files seamlessly
🛡️ Robust Error Handling: Built-in timeout handling, syntax error detection, and graceful failure recovery
⚙️ Flexible Configuration: Comprehensive configuration options for execution behavior
🔄 Processing Pipeline: Powerful preprocessing and postprocessing pipeline for custom workflows
📊 Resource Monitoring: Memory and CPU usage monitoring with configurable limits
🎯 Production Ready: Battle-tested for large-scale code execution workloads

📦 Installation

pip install simple-code-execution

Requirements

Python 3.10+
psutil >= 5.9
numpy >= 1.26
aiofiles >= 22.1.0
tqdm >= 4.60.0
ujson >= 5.10.0

🔥 Quick Start

from code_execution import ExecutionConfig, execute_predictions, Executable, Command

# Define your code predictions
predictions = [
    {"id": 1, "code": "print('Hello, World!')"},
    {"id": 2, "code": "x = 5\nprint(x * 2)"},
    {"id": 3, "code": "import math\nprint(math.sqrt(16))"},
]

# Configure execution settings
config = ExecutionConfig(
    num_workers=2,           # Number of parallel workers
    default_timeout=10,      # Timeout in seconds
    max_execute_at_once=3    # Max concurrent executions
)

# Define preprocessor: converts predictions to executable commands
def preprocessor(prediction):
    return Executable(
        files={"main.py": prediction["code"]},  # Files to write
        commands=[Command(command=["python3", "main.py"])],  # Commands to run
        tracked_files=[]  # Files to read back after execution
    )

# Define postprocessor: processes execution results
def postprocessor(prediction, result):
    return {
        "id": prediction["id"],
        "code": prediction["code"],
        "output": result.command_results[0].stdout,
        "success": result.command_results[0].return_code == 0,
        "runtime": result.command_results[0].runtime
    }

# Execute all predictions
results = execute_predictions(
    config=config,
    pred_list=predictions,
    preprocessor=preprocessor,
    postprocessor=postprocessor
)

# Print results
for result in results.results:
    print(f"ID: {result['id']}")
    print(f"Output: {result['output'].strip()}")
    print(f"Success: {result['success']}")
    print(f"Runtime: {result['runtime']:.3f}s")
    print("-" * 40)

Output:

ID: 1
Output: Hello, World!
Success: True
Runtime: 0.045s
----------------------------------------
ID: 2
Output: 10
Success: True
Runtime: 0.043s
----------------------------------------
ID: 3
Output: 4.0
Success: True
Runtime: 0.051s
----------------------------------------

🏗️ Architecture

The library follows a simple but powerful workflow:

Preprocess → Convert your data into Executable objects
Execute → Run code in parallel with resource management
Postprocess → Combine results with original predictions

graph LR
    A[Predictions] --> B[Preprocessor]
    B --> C[Executor]
    C --> D[Postprocessor]
    D --> E[Results]

⚙️ Configuration

config = ExecutionConfig(
    num_workers=4,              # Parallel workers
    default_timeout=30,         # Default timeout per command
    max_execute_at_once=10,     # Max concurrent executions
    write_rate_limit=768,       # File writing rate limit
    display_write_progress=True # Show progress bars
)

🎯 Use Cases

Code Generation Evaluation: Test AI-generated code at scale
Competitive Programming: Run solutions against test cases
Code Analysis: Execute and analyze code behavior
Educational Tools: Safe code execution in learning environments
Research: Large-scale code execution experiments

⚡ Advanced Features

Multiple Commands per Prediction

def multi_command_preprocessor(prediction):
    return Executable(
        files={
            "setup.py": "# Setup code",
            "main.py": prediction["code"]
        },
        commands=[
            Command(command=["python3", "setup.py"]),
            Command(command=["python3", "main.py"], timeout=5)
        ],
        tracked_files=["output.txt"]  # Read this file after execution
    )

Custom Early Stopping

def custom_early_stop(cmd_idx, result):
    # Stop if command fails
    if result.return_code != 0:
        return True
    # Stop if output contains error
    if "error" in result.stdout.lower():
        return True
    return False

executable = Executable(
    files={"test.py": code},
    commands=[Command(command=["python3", "test.py"])],
    should_early_stop=custom_early_stop
)

⚠️ Important Notes

Pickleable Functions Required

Both preprocessor and postprocessor functions must be pickleable (serializable) for multiprocessing:

✅ Good:

def my_preprocessor(prediction):
    return Executable(...)

❌ Bad:

# Lambda - not pickleable
preprocessor = lambda pred: Executable(...)

# Nested function - not pickleable
def outer():
    def preprocessor(pred):
        return Executable(...)
    return preprocessor

📚 Documentation

Quick Start Guide - Get up and running in minutes
API Reference - Complete API documentation
Full Documentation - Comprehensive guides and examples

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

Development Setup

git clone https://github.com/gabeorlanski/simple-code-execution.git
cd simple-code-execution
pip install -e .
pip install -r docs/requirements.txt

# Run tests
pytest

# Build documentation locally
cd docs
make html
make serve  # Serves at http://localhost:8000

📄 License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details.

🙏 Acknowledgments

Built for reliable, scalable code execution in research and production environments
Designed with safety and resource management as core principles
Optimized for both single-use scripts and long-running services

Made with ❤️ by Gabriel Orlanski

Name		Name	Last commit message	Last commit date
Latest commit History 125 Commits
.github/workflows		.github/workflows
code_execution		code_execution
docs		docs
tests		tests
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

simple-code-execution

🚀 Features

📦 Installation

Requirements

🔥 Quick Start

🏗️ Architecture

⚙️ Configuration

🎯 Use Cases

⚡ Advanced Features

Multiple Commands per Prediction

Custom Early Stopping

⚠️ Important Notes

Pickleable Functions Required

📚 Documentation

🤝 Contributing

Development Setup

📄 License

🙏 Acknowledgments

About

Uh oh!

Releases 2

Packages

Uh oh!

Languages

License

gabeorlanski/simple-code-execution

Folders and files

Latest commit

History

Repository files navigation

simple-code-execution

🚀 Features

📦 Installation

Requirements

🔥 Quick Start

🏗️ Architecture

⚙️ Configuration

🎯 Use Cases

⚡ Advanced Features

Multiple Commands per Prediction

Custom Early Stopping

⚠️ Important Notes

Pickleable Functions Required

📚 Documentation

🤝 Contributing

Development Setup

📄 License

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Languages

Packages