01 · Roasts
73% Graveyard Ratio
Nearly three-quarters of your 28 repos haven't seen a push in 2+ years. Your GitHub profile is less a portfolio and more an archaeological dig site. RIP to whatever PHP project is buried in there.
Zero Tests, Three Repos
Not a single test file across any of your three analyzed repos. You work at Huawei Noah's Ark Lab on research code, but apparently the only thing being tested is your reviewer's patience.
44 Public Commits All Year
44 commits in a year is roughly 1 commit every 8 days. The heatmap shows you went completely dark from April through October. privateWorkLikely=true suggests you're actually doing things — just nowhere anyone can see.
interleave: The 3-Hour Repo
interleave was created and last pushed on the same day (2026-04-21) within a 3-hour window and has 2 commits. That's not a project, that's a git init with ambition.
C++ Majority, No C++ Repos Shipped
42% of your codebase by bytes is C++, yet none of your visible repos are C++ projects — it's all buried in those 73% stale repos. Your most-used language is basically a ghost.
Built using
Zoral
Shadows one worker for a week, then takes over their job with zero extra setup. Behaves exactly like the original.
zoral.ai
02 · Category breakdown
- Impact25% weight36F
- Consistency20% weight55D
- Quality20% weight43D
- Depth15% weight40D
- Breadth10% weight65C
- Community10% weight40D
03 · Stats
365-day commit heatmap
72 active days
Language distribution
- C++42%
- PHP19%
- C18%
- Python14%
- Jupyter Notebook4%
- CSS1%
- Other2%
04 · Numbers
Owned repos
non-fork
22
Commits
last 12 months
44
Followers
46
Joined GitHub
Dec 2013
05 · Top repos
roymiles /
diffusion-stitching
Research implementation of diffusion-based language model inference technique with structured evaluation framework. Typed Python codebase with comprehensive documentation but no tests or CI; demonstrates sustained thought through modular evaluation pipeline (generation, PRM scoring, stitching) across 6 benchmarks.
roymiles /
roymiles.github.io
Personal portfolio/project showcase website (145 KB static HTML+CSS) with multiple research project pages. Clean structure, responsive design, no README, CI via GitHub Pages workflow, but lacks tests and documentation.
roymiles /
interleave
One-day-old benchmark specification with README + ~1.5 MB Python framework scaffold. No tests, CI, license, or functional code—appears to be initial project dump with pre-existing VLMEvalKit framework imported but not integrated.
06 · Timeline
- Dec 23, 2013Joined GitHub
- Aug 23, 2018Created roymiles.github.io
- Feb 11, 2026Created diffusion-stitching — Stitching Noisy Diffusion Thoughts for Better Reasoning
- Apr 21, 2026Created interleave
- Apr 28, 2026Most recent push to diffusion-stitching
07 · Compare
08 · Rubric
How this score was produced
Overall = Σ (category × weight) + gentle top-end curve
Tier thresholds
▸ How the pipeline works
- 01Scrape.Pull every non-fork repo pushed in the last 90 days, plus your contribution calendar, followers, and language byte counts — straight from GitHub's REST & GraphQL APIs.
- 02Triage.A small model reads every repo's file tree + README and picks the 20 files per repo that actually reveal how you code.
- 03Grade each repo. All repos run in parallel through a fast scoring model that reads the picked files and rates each one independently on Impact, Quality, and Depth — with evidence citations.
- 04Aggregate. A larger reasoning model combines the per-repo scores with server-computed stats (heatmap, commit cadence, language entropy, follower count) to produce the 6-dimension profile score + roasts.
- 05Correct.Deterministic server-side checks enforce anchor-scale floors (e.g. a profile with 2,000+ public commits can't score 30 Consistency) and recompute the final verdict.
~90 seconds per profile, ~$0.25 in compute. Total of ~240 files read across your top-12 repos. One rating per GitHub account per day.
▸ Data sources & caveats
- Heatmap & commit totals: GitHub GraphQL
contributionsCollection— covers the last 365 days, includes private repos when the user has opted in (default). - Language %: byte totals across the top 30 owned non-fork repos.
- Curve: a small upward nudge centered on raw score ≈ 70, capping at 100. Prevents specialists from being unfairly penalised for narrow breadth.
- Anchor corrections: when server-measured signals (e.g. privateWorkLikely, multiRepoVolume, follower count) mandate a minimum category score, the aggregation step enforces it. These are signal-conditional, not identity-based floors.