4.3 KiB
Mathstream Library
mathstream offers streamed, string-based arithmetic for very large integers that you may not want to load entirely into memory. Instead of parsing numbers into Python int values, you work with digit files on disk via StreamNumber and call math operations that operate chunk-by-chunk.
Quick Start
python -m venv venv
source venv/bin/activate
pip install -e .
Create digit files anywhere you like (the examples below use instance/log), or supply ad-hoc literals, then construct StreamNumber objects and call the helpers:
from mathstream import (
StreamNumber,
add,
sub,
mul,
div,
mod,
pow,
is_even,
is_odd,
free_stream,
collect_garbage,
)
a = StreamNumber("instance/log/huge.txt")
b = StreamNumber(literal="34567")
e = StreamNumber(literal="3")
print("sum =", "".join(add(a, b).stream()))
print("difference =", "".join(sub(a, b).stream()))
print("product =", "".join(mul(a, b).stream()))
print("quotient =", "".join(div(a, b).stream()))
print("modulo =", "".join(mod(a, b).stream()))
print("power =", "".join(pow(a, e).stream()))
print("a is even?", is_even(a))
print("b is odd?", is_odd(b))
# drop staged artifacts immediately when you are done
free_stream(b)
# reclaim space for files whose age outweighs their use
collect_garbage(0.5)
Each arithmetic call writes its result back into instance/log (configurable via mathstream.number.LOG_DIR) so you can stream the digits later or reuse them in further operations.
Core Concepts
- StreamNumber(path | literal=...) – Wraps a digit text file or creates one for an integer literal inside
LOG_DIR. Literal operands are persisted asliteral_<hash>.txt, so repeated runs reuse the same staged file (note thatclear_logs()removes these cache files too). .stream(chunk_size)– Yields strings of digits with the provided chunk size. Operations inmathstream.engineconsume these streams to avoid loading the entire number at once.- Automatic staging – Outputs are stored under
LOG_DIRwith hashes based on input file paths, letting you compose operations without manual bookkeeping. - Sign-aware – Addition, subtraction, multiplication, division (
//behavior), modulo, and exponentiation (non-negative exponents) all respect operand sign. Division/modulo follow Python’s floor-division rules. - Utilities –
clear_logs()wipes prior staged results so you can start fresh. - Manual freeing – Call
stream.free()(orfree_stream(stream)) once you are done with a staged number to release its reference immediately. Logger metadata keeps per-path reference counts so the final free removes the backing file on the spot. - Parity helpers –
is_evenandis_oddinspect the streamed digits without materializing the integer. - Garbage collection –
collect_garbage(score_threshold)computes a score from file age, access count, and reference count (tracked ininstance/mathstream_logs.sqlite, freshly truncated each run). Files whose score meets or exceeds the threshold are deleted, letting you tune how aggressively to reclaim space. Both staged results and literal caches participate. Usetracked_files()oractive_streams()to inspect current state.
Divide-by-zero scenarios raise the custom DivideByZeroError so callers can distinguish mathstream issues from Python’s native exceptions.
Example Script
test.py in the repository root demonstrates a minimal workflow:
- Writes sample operands to
tests/*.txt. - Calls every arithmetic primitive plus the modulo/parity helpers.
- Asserts that the streamed outputs match known values (helpful for quick regression checks).
Run it via:
python test.py
Extending
- To hook into other storage backends, implement your own
StreamNumbervariant with the same.stream()interface. - Need modulo or gcd? Compose the existing primitives (e.g., repeated subtraction or using
div+ remainder tracking inside_divide_abs) or add new helpers following the same streamed pattern. - For more control over output locations, override
LOG_DIRbefore using the operations:
from mathstream import engine
from pathlib import Path
engine.LOG_DIR = Path("/tmp/my_mathstage")
engine.clear_logs()
With these building blocks, you can manipulate arbitrarily large integers while keeping memory usage constant. Happy streaming!