sqrtspace-experiments/README.md

# The Ubiquity of Space-Time Tradeoffs: Experiments & Implementation

This repository contains the experimental code, case studies, and interactive dashboard accompanying the paper "The Ubiquity of Space-Time Simulation in Modern Computing: From Theory to Practice".

**Paper Repository**: [github.com/sqrtspace/sqrtspace-paper](https://github.com/sqrtspace/sqrtspace-paper)
**Interactive Dashboard**: Run locally with `streamlit run dashboard/app.py`
**Based on**: Ryan Williams' 2025 result that TIME[t] ⊆ SPACE[√(t log t)]

## Overview

This project demonstrates how theoretical space-time tradeoffs manifest in real-world systems through:
- **Controlled experiments** validating the √n relationship
- **Production system analysis** (PostgreSQL, Flash Attention, MapReduce)
- **Interactive visualizations** exploring memory hierarchies
- **Practical tools** for optimizing space-time tradeoffs

## Key Findings

- Theory predicts √n slowdown, practice shows 100-10,000× due to constant factors
- Memory hierarchy (L1/L2/L3/RAM/Disk) dominates performance
- Cache-friendly algorithms can be faster with less memory
- The √n pattern appears everywhere: database buffers, ML checkpointing, distributed systems

## Experiments

### 1. Maze Solver (C#)
**Location:** `experiments/maze_solver/`

Demonstrates graph traversal with memory constraints:
- BFS: O(n) memory, 1ms runtime
- Memory-Limited DFS: O(√n) memory, 5ms runtime (5× slower)

```bash
cd experiments/maze_solver
dotnet run
```

### 2. Checkpointed Sorting (Python)
**Location:** `experiments/checkpointed_sorting/`

Shows massive I/O penalties when reducing memory:
- In-memory: O(n) space, 0.0001s
- Checkpointed: O(√n) space, 0.268s (2,680× slower!)

```bash
cd experiments/checkpointed_sorting
python checkpointed_sort.py
```

### 3. Stream Processing (Python)
**Location:** `experiments/stream_processing/`

Reveals when less memory is actually faster:
- Full history: O(n) memory, 0.33s
- Sliding window: O(w) memory, 0.011s (30× faster!)

```bash
cd experiments/stream_processing
python sliding_window.py
```

## Case Studies

### Database Systems (`case_studies/database_systems.md`)
- PostgreSQL buffer pool sizing follows √(database_size)
- Query optimizer chooses algorithms based on available memory
- Hash joins (fast) vs nested loops (slow) show 200× performance difference

### Large Language Models (`case_studies/llm_transformers.md`)
- Flash Attention: O(n²) → O(n) memory for 10× longer contexts
- Gradient checkpointing: √n layers stored
- Quantization: 8× memory reduction for 2-3× slowdown

### Distributed Computing (`case_studies/distributed_computing.md`)
- MapReduce: Optimal shuffle buffer = √(data_per_node)
- Spark: Memory fraction settings control space-time tradeoffs
- Hierarchical aggregation naturally forms √n levels

## Quick Start

### Prerequisites
- Python 3.8+ (for Python experiments)
- .NET Core SDK (for C# maze solver)
- 2GB free memory for experiments

### Installation
```bash
# Clone repository
git clone https://github.com/sqrtspace/sqrtspace-experiments.git
cd Ubiquity

# Install Python dependencies
pip install -r requirements.txt

# Run the dashboard
streamlit run dashboard/app.py
```

### Running All Experiments
```bash
# Run each experiment
cd experiments/maze_solver && dotnet run && cd ../..
cd experiments/checkpointed_sorting && python checkpointed_sort.py && cd ../..
cd experiments/stream_processing && python sliding_window.py && cd ../..
```

## Repository Structure

```
├── experiments/           # Core experiments demonstrating tradeoffs
│   ├── maze_solver/      # C# graph traversal with memory limits
│   ├── checkpointed_sorting/  # Python external sorting
│   └── stream_processing/     # Python sliding window vs full storage
├── case_studies/         # Analysis of production systems
│   ├── database_systems.md
│   ├── llm_transformers.md
│   └── distributed_computing.md
├── dashboard/            # Interactive Streamlit visualizations
│   └── app.py           # 6-page interactive dashboard
├── SUMMARY.md           # Comprehensive findings
└── FINDINGS.md          # Experimental results analysis
```

## Interactive Dashboard

The dashboard (`dashboard/app.py`) includes:
1. **Space-Time Calculator**: Find optimal configurations
2. **Memory Hierarchy Simulator**: Visualize cache effects
3. **Algorithm Comparisons**: See tradeoffs in action
4. **LLM Optimizations**: Flash Attention demonstrations
5. **Production Examples**: Real-world case studies

## Measurement Framework

`experiments/measurement_framework.py` provides:
- Continuous memory monitoring (10ms intervals)
- Cache-aware benchmarking
- Statistical analysis across multiple runs
- Automated visualization generation

## Extending the Work

### Adding New Experiments
1. Create folder in `experiments/`
2. Implement space-time tradeoff variants
3. Use `measurement_framework.py` for profiling
4. Document findings in experiment README

### Contributing Case Studies
1. Analyze a system with space-time tradeoffs
2. Document the √n patterns you find
3. Add to `case_studies/` folder
4. Submit pull request

## Citation

If you use this code or build upon our work:

```bibtex
@article{friedel2025ubiquity,
  title={The Ubiquity of Space-Time Simulation in Modern Computing: From Theory to Practice},
  author={Friedel Jr., David H.},
  journal={arXiv preprint arXiv:25XX.XXXXX},
  year={2025}
}
```

## Contact

**Author**: David H. Friedel Jr.
**Organization**: MarketAlly LLC (USA) & MarketAlly Pte. Ltd. (Singapore)
**Email**: dfriedel@marketally.com

## License

This work is licensed under CC BY 4.0. You may share and adapt the material with proper attribution.

## Acknowledgments

- Ryan Williams for the theoretical foundation
- The authors of Flash Attention, PostgreSQL, and Apache Spark
- Early-stage R&D support from MarketAlly LLC and MarketAlly Pte. Ltd.