BitterMill - See what a Mac can really run

See what a Mac can really run.

BitterMill lets you use serious open models on real Apple-silicon machines before you buy hardware, move a workload local, or trust a benchmark table.

Choose a machine. Load a model. Decide from experience.

Benchmarks tell you speed. BitterMill tells you whether you would actually want to use it.

Named machines

Try models on real Apple-silicon classes instead of abstract benchmark rows.

Live behavior

See cold starts, warm starts, queueing, and responsiveness instead of just a tokens-per-second claim.

Clear economics

Pay separately for model load, warm hold, and generation so the tradeoffs stay legible.

From curiosity to conviction in one session.

Run it live

Open the model on the machine you actually care about.

Use it directly instead of inferring quality from other people’s charts.

Compare classes

See what really changes when memory goes up.

Run the same model across different machines and feel the difference in load, headroom, and rhythm.

Keep it warm

Reserve the experience you want.

Warm sessions, cold starts, and priority all feel different. BitterMill treats them differently.

Choose the class of Mac you want to understand.

Available now

M4 Max Mac Studio

64 GB unified memory

The practical desktop test: how far can a serious Mac go before you need more memory or a different class of machine?

Available now

M5 MacBook

128 GB unified memory

The high-headroom test: best for larger open models, more ambitious workloads, and finding out what happens when memory stops being the first limit.

Coming next

More Apple silicon classes

The fleet expands over time, so you can test more of the Apple-silicon range without owning every machine yourself.

Pay for the part you actually use.

BitterMill separates setup cost from runtime cost so the economics stay legible.

Load

Spin up the model you want on the machine you chose.

Warm hold

Keep a useful model resident when low latency matters.

Generation

Pay for the actual run, not for folklore around the infrastructure.

Priority

Move faster when you need guaranteed attention from scarce machine memory.

Request early access.

Tell us what you want to run and what Mac question you need answered.

Access is approved manually while the fleet grows.