tomalmog · 58.4/100 — Rate My GitHub

01 · Roasts

CI? Only in One Zip Code

Six repos scored, and exactly ONE has CI — tempo. benchmark-mondays has 10+ test suites and still no pipeline. You know how to write tests; you just refuse to run them automatically.

344 KB Engine, 0 Stars, 0 README

benchmark-mondays is your most architecturally impressive repo — Box-Muller price simulation, full poker hand evaluator, multi-arena agent runner — and you didn't even write a README. It's a masterpiece in a locked room.

The 'tempo' Anomaly

20 stars on a Claude Code rate-limiter but zero stars on an ML training platform with 13 methods and GPU support. Your marketing strategy appears to be 'release and pray'.

Python Monolinguist with a TypeScript Side Hustle

79% Python, 14% TypeScript — you're not multi-lingual, you're bilingual at best. GDScript at 1% suggests a game project that never got past 'hello world' in Godot.

Night Owl Who Codes in Bursts

60% night-owl commits, entire zero-weeks scattered across the heatmap. You're not a daily coder — you're a binge coder who disappears for a week then drops 4-level commit days on a Saturday.

Built using

Zoral

Shadows one worker for a week, then takes over their job with zero extra setup. Behaves exactly like the original.

zoral.ai

02 · Category breakdown

Impact
25% weight
48D
Consistency
20% weight
65C
Quality
20% weight
58D
Depth
15% weight
60C
Breadth
10% weight
65C
Community
10% weight
30F

03 · Stats

365-day commit heatmap

148 active days

Less

Language distribution

7 langs

Python79%
TypeScript14%
GDScript1%
JavaScript1%
CSS1%
HTML1%
Other3%

04 · Numbers

Owned repos

non-fork

Commits

last 12 months

699

Followers

Joined GitHub

Oct 2019

05 · Top repos

tomalmog /

crucible

53/100

Comprehensive ML training platform (13 methods, remote GPU, interp tools) with typed Python, structured architecture, and extensive docs. Active development but still nascent (0 stars, ~55 commits in 2 months).

I40Q60D60

READMETests

Python★ 03mo ago

tomalmog /

tempo

52/100

Python CLI automation tool for Claude Code with rate-limit handling and session persistence. Typed, well-documented, structured codebase with CI/CD but no tests. 30 commits across ~3 months shows steady development.

I40Q60D55

READMECI

Python★ 204mo ago

tomalmog /

personal

50/100

Personal portfolio website in TypeScript with Next.js, React, and integrated Groq AI chatbot. Well-structured, typed frontend with functional demo projects and experience showcase, but minimal tests/CI and limited architectural scope for a portfolio project.

I40Q60D50

READMETyped

TypeScript★ 03mo ago

tomalmog /

benchmark-mondays

45/100

Monorepo for multi-arena AI agent competition engine (stock trading + poker). TypeScript codebase, multi-package with test coverage, but zero public adoption, no README, and created within 3 weeks.

I25Q60D50

TestsTyped

TypeScript★ 03mo ago

tomalmog /

slide

38/100

Personal Expo/React Native prediction market app with TypeScript, structured multi-file layout, and live crypto integration. No tests, CI, or license; boilerplate-heavy README. ~40 days old, 11 recent commits.

I25Q55D35

READMETyped

TypeScript★ 05mo ago

tomalmog /

musical

23/100

TypeScript Next.js "Heardle" game cloning Last.fm data into a music-guessing game with YouTube playback. Demonstrates structured React hooks and API integration, but is brand-new (2 commits, 85 KB), lacks tests/CI, and is a tutorial/one-off learning project.

I15Q45D10

READMETyped

TypeScript★ 04mo ago

06 · Timeline

Oct 29, 2019
Joined GitHub
Nov 13, 2025
Created personal — Personal Website
Dec 12, 2025
Created tempo — Automated Claude Code runner with rate limit handling. Run long Claude Code tasks overnight. Tempo automatically detects rate limits, waits for reset, and continues your task with
Jan 10, 2026
Created slide
Feb 10, 2026
Created crucible — LLM training suite
Mar 13, 2026
Created musical
Mar 27, 2026
Created benchmark-mondays
Apr 16, 2026
Most recent push to personal

07 · Compare

Compare tomalmog against

github.com/

tomalmog · 6dmedian coder

08 · Rubric

How this score was produced

Overall = Σ (category × weight) + gentle top-end curve

CategoryWeightScoreContrib.

Raw total55.1

Top-end curve+3.3

Final overall58.4

Tier thresholds

S90–100Mass-producing humansA80–89Ship machineB70–79Solid engineerC60–69Getting thereD40–59README enthusiastF0–39GitHub tourist

▸ How the pipeline works

01Scrape.Pull every non-fork repo pushed in the last 90 days, plus your contribution calendar, followers, and language byte counts — straight from GitHub's REST & GraphQL APIs.
02Triage.A small model reads every repo's file tree + README and picks the 20 files per repo that actually reveal how you code.
03Grade each repo. All repos run in parallel through a fast scoring model that reads the picked files and rates each one independently on Impact, Quality, and Depth — with evidence citations.
04Aggregate. A larger reasoning model combines the per-repo scores with server-computed stats (heatmap, commit cadence, language entropy, follower count) to produce the 6-dimension profile score + roasts.
05Correct.Deterministic server-side checks enforce anchor-scale floors (e.g. a profile with 2,000+ public commits can't score 30 Consistency) and recompute the final verdict.

~90 seconds per profile, ~$0.25 in compute. Total of ~240 files read across your top-12 repos. One rating per GitHub account per day.

▸ Data sources & caveats

Heatmap & commit totals: GitHub GraphQL contributionsCollection — covers the last 365 days, includes private repos when the user has opted in (default).
Language %: byte totals across the top 30 owned non-fork repos.
Curve: a small upward nudge centered on raw score ≈ 70, capping at 100. Prevents specialists from being unfairly penalised for narrow breadth.
Anchor corrections: when server-measured signals (e.g. privateWorkLikely, multiRepoVolume, follower count) mandate a minimum category score, the aggregation step enforces it. These are signal-conditional, not identity-based floors.