183 lines
5.9 KiB
Markdown
183 lines
5.9 KiB
Markdown
# The Ubiquity of Space-Time Tradeoffs: Experiments & Implementation
|
||
|
||
This repository contains the experimental code, case studies, and interactive dashboard accompanying the paper "The Ubiquity of Space-Time Simulation in Modern Computing: From Theory to Practice".
|
||
|
||
**Paper Repository**: [github.com/sqrtspace/sqrtspace-paper](https://github.com/sqrtspace/sqrtspace-paper)
|
||
**Interactive Dashboard**: Run locally with `streamlit run dashboard/app.py`
|
||
**Based on**: Ryan Williams' 2025 result that TIME[t] ⊆ SPACE[√(t log t)]
|
||
|
||
## Overview
|
||
|
||
This project demonstrates how theoretical space-time tradeoffs manifest in real-world systems through:
|
||
- **Controlled experiments** validating the √n relationship
|
||
- **Production system analysis** (PostgreSQL, Flash Attention, MapReduce)
|
||
- **Interactive visualizations** exploring memory hierarchies
|
||
- **Practical tools** for optimizing space-time tradeoffs
|
||
|
||
## Key Findings
|
||
|
||
- Theory predicts √n slowdown, practice shows 100-10,000× due to constant factors
|
||
- Memory hierarchy (L1/L2/L3/RAM/Disk) dominates performance
|
||
- Cache-friendly algorithms can be faster with less memory
|
||
- The √n pattern appears everywhere: database buffers, ML checkpointing, distributed systems
|
||
|
||
## Experiments
|
||
|
||
### 1. Maze Solver (C#)
|
||
**Location:** `experiments/maze_solver/`
|
||
|
||
Demonstrates graph traversal with memory constraints:
|
||
- BFS: O(n) memory, 1ms runtime
|
||
- Memory-Limited DFS: O(√n) memory, 5ms runtime (5× slower)
|
||
|
||
```bash
|
||
cd experiments/maze_solver
|
||
dotnet run
|
||
```
|
||
|
||
### 2. Checkpointed Sorting (Python)
|
||
**Location:** `experiments/checkpointed_sorting/`
|
||
|
||
Shows massive I/O penalties when reducing memory:
|
||
- In-memory: O(n) space, 0.0001s
|
||
- Checkpointed: O(√n) space, 0.268s (2,680× slower!)
|
||
|
||
```bash
|
||
cd experiments/checkpointed_sorting
|
||
python checkpointed_sort.py
|
||
```
|
||
|
||
### 3. Stream Processing (Python)
|
||
**Location:** `experiments/stream_processing/`
|
||
|
||
Reveals when less memory is actually faster:
|
||
- Full history: O(n) memory, 0.33s
|
||
- Sliding window: O(w) memory, 0.011s (30× faster!)
|
||
|
||
```bash
|
||
cd experiments/stream_processing
|
||
python sliding_window.py
|
||
```
|
||
|
||
## Case Studies
|
||
|
||
### Database Systems (`case_studies/database_systems.md`)
|
||
- PostgreSQL buffer pool sizing follows √(database_size)
|
||
- Query optimizer chooses algorithms based on available memory
|
||
- Hash joins (fast) vs nested loops (slow) show 200× performance difference
|
||
|
||
### Large Language Models (`case_studies/llm_transformers.md`)
|
||
- Flash Attention: O(n²) → O(n) memory for 10× longer contexts
|
||
- Gradient checkpointing: √n layers stored
|
||
- Quantization: 8× memory reduction for 2-3× slowdown
|
||
|
||
### Distributed Computing (`case_studies/distributed_computing.md`)
|
||
- MapReduce: Optimal shuffle buffer = √(data_per_node)
|
||
- Spark: Memory fraction settings control space-time tradeoffs
|
||
- Hierarchical aggregation naturally forms √n levels
|
||
|
||
## Quick Start
|
||
|
||
### Prerequisites
|
||
- Python 3.8+ (for Python experiments)
|
||
- .NET Core SDK (for C# maze solver)
|
||
- 2GB free memory for experiments
|
||
|
||
### Installation
|
||
```bash
|
||
# Clone repository
|
||
git clone https://github.com/sqrtspace/sqrtspace-experiments.git
|
||
cd Ubiquity
|
||
|
||
# Install Python dependencies
|
||
pip install -r requirements.txt
|
||
|
||
# Run the dashboard
|
||
streamlit run dashboard/app.py
|
||
```
|
||
|
||
### Running All Experiments
|
||
```bash
|
||
# Run each experiment
|
||
cd experiments/maze_solver && dotnet run && cd ../..
|
||
cd experiments/checkpointed_sorting && python checkpointed_sort.py && cd ../..
|
||
cd experiments/stream_processing && python sliding_window.py && cd ../..
|
||
```
|
||
|
||
## Repository Structure
|
||
|
||
```
|
||
├── experiments/ # Core experiments demonstrating tradeoffs
|
||
│ ├── maze_solver/ # C# graph traversal with memory limits
|
||
│ ├── checkpointed_sorting/ # Python external sorting
|
||
│ └── stream_processing/ # Python sliding window vs full storage
|
||
├── case_studies/ # Analysis of production systems
|
||
│ ├── database_systems.md
|
||
│ ├── llm_transformers.md
|
||
│ └── distributed_computing.md
|
||
├── dashboard/ # Interactive Streamlit visualizations
|
||
│ └── app.py # 6-page interactive dashboard
|
||
├── SUMMARY.md # Comprehensive findings
|
||
└── FINDINGS.md # Experimental results analysis
|
||
```
|
||
|
||
## Interactive Dashboard
|
||
|
||
The dashboard (`dashboard/app.py`) includes:
|
||
1. **Space-Time Calculator**: Find optimal configurations
|
||
2. **Memory Hierarchy Simulator**: Visualize cache effects
|
||
3. **Algorithm Comparisons**: See tradeoffs in action
|
||
4. **LLM Optimizations**: Flash Attention demonstrations
|
||
5. **Production Examples**: Real-world case studies
|
||
|
||
## Measurement Framework
|
||
|
||
`experiments/measurement_framework.py` provides:
|
||
- Continuous memory monitoring (10ms intervals)
|
||
- Cache-aware benchmarking
|
||
- Statistical analysis across multiple runs
|
||
- Automated visualization generation
|
||
|
||
## Extending the Work
|
||
|
||
### Adding New Experiments
|
||
1. Create folder in `experiments/`
|
||
2. Implement space-time tradeoff variants
|
||
3. Use `measurement_framework.py` for profiling
|
||
4. Document findings in experiment README
|
||
|
||
### Contributing Case Studies
|
||
1. Analyze a system with space-time tradeoffs
|
||
2. Document the √n patterns you find
|
||
3. Add to `case_studies/` folder
|
||
4. Submit pull request
|
||
|
||
## Citation
|
||
|
||
If you use this code or build upon our work:
|
||
|
||
```bibtex
|
||
@article{friedel2025ubiquity,
|
||
title={The Ubiquity of Space-Time Simulation in Modern Computing: From Theory to Practice},
|
||
author={Friedel Jr., David H.},
|
||
journal={arXiv preprint arXiv:25XX.XXXXX},
|
||
year={2025}
|
||
}
|
||
```
|
||
|
||
## Contact
|
||
|
||
**Author**: David H. Friedel Jr.
|
||
**Organization**: MarketAlly LLC (USA) & MarketAlly Pte. Ltd. (Singapore)
|
||
**Email**: dfriedel@marketally.com
|
||
|
||
## License
|
||
|
||
This work is licensed under CC BY 4.0. You may share and adapt the material with proper attribution.
|
||
|
||
## Acknowledgments
|
||
|
||
- Ryan Williams for the theoretical foundation
|
||
- The authors of Flash Attention, PostgreSQL, and Apache Spark
|
||
- Early-stage R&D support from MarketAlly LLC and MarketAlly Pte. Ltd.
|