01 · Roasts
442 repos, 3 stars — collector's edition
You've been on GitHub since 2009 — 16 years, 442 public repos — and the entire portfolio has earned 3 stars. That's one star per 5.3 years of effort. The archive is impressive; the audience is not.
The README said 'requirements. tx'
rl-llm-calibration-test is your strongest repo and its README has a typo in the install instructions. If your best foot forward has a typo, the other 441 repos must be barefoot.
Zero PRs, zero issues, 1651 following
You follow 1,651 people but opened zero external PRs and zero issues this year. That's not community engagement — that's a very curated reading list.
Two repos born and died on the same day
cs-510-computational-imaging and oresat-startracker-calibration-test both have their first and last commit on 2023-02-08. They lived their entire lives in a single afternoon. RIP.
HDL king of nobody's court
Verilog and VHDL together make up 35% of your codebase — more than any single language. Zero stars, zero forks. The synthesis reports compile; the GitHub metrics do not.
Built using
Zoral
Shadows one worker for a week, then takes over their job with zero extra setup. Behaves exactly like the original.
zoral.ai
02 · Category breakdown
- Impact25% weight28F
- Consistency20% weight35F
- Quality20% weight33F
- Depth15% weight35F
- Breadth10% weight65C
- Community10% weight40D
03 · Stats
365-day commit heatmap
135 active days
Language distribution
- TeX38%
- Verilog18%
- VHDL17%
- Jupyter Notebook8%
- HTML8%
- C++6%
- Other5%
04 · Numbers
Owned repos
non-fork
17
Commits
last 12 months
116
Followers
121
Joined GitHub
Apr 2009
05 · Top repos
aakarsh /
rl-llm-calibration-test
Research replication codebase measuring LLM calibration on open datasets (MMLU, LogicQA, TruthfulQA). Typed Python with structured runner modules, has tests and CI, but limited documentation and nascent research focus. Single author, minimal stars/forks.
aakarsh /
cs-510-computational-imaging
Course assignment repository with minimal content (224KB, TeX language). README is nearly empty, no tests/CI, 5 commits over ~3 hours on 2023-02-08. Appears to be a one-off academic submission.
aakarsh /
oresat-startracker-calibration-test
One-off calibration script for OreSat star tracker with minimal docs, no tests, no CI, and only ~28KB of code committed in a single day (4 of 30 commits sampled).
06 · Timeline
- Apr 18, 2009Joined GitHub
- Feb 8, 2023Created cs-510-computational-imaging
- Feb 8, 2023Created oresat-startracker-calibration-test
- Mar 11, 2024Created rl-llm-calibration-test — Attempt at replication of the parts of the paper "Language models (mostly) know what they know", on open datasets, and models.
- Apr 16, 2024Most recent push to rl-llm-calibration-test
07 · Compare
08 · Rubric
How this score was produced
Overall = Σ (category × weight) + gentle top-end curve
Tier thresholds
▸ How the pipeline works
- 01Scrape.Pull every non-fork repo pushed in the last 90 days, plus your contribution calendar, followers, and language byte counts — straight from GitHub's REST & GraphQL APIs.
- 02Triage.A small model reads every repo's file tree + README and picks the 20 files per repo that actually reveal how you code.
- 03Grade each repo. All repos run in parallel through a fast scoring model that reads the picked files and rates each one independently on Impact, Quality, and Depth — with evidence citations.
- 04Aggregate. A larger reasoning model combines the per-repo scores with server-computed stats (heatmap, commit cadence, language entropy, follower count) to produce the 6-dimension profile score + roasts.
- 05Correct.Deterministic server-side checks enforce anchor-scale floors (e.g. a profile with 2,000+ public commits can't score 30 Consistency) and recompute the final verdict.
~90 seconds per profile, ~$0.25 in compute. Total of ~240 files read across your top-12 repos. One rating per GitHub account per day.
▸ Data sources & caveats
- Heatmap & commit totals: GitHub GraphQL
contributionsCollection— covers the last 365 days, includes private repos when the user has opted in (default). - Language %: byte totals across the top 30 owned non-fork repos.
- Curve: a small upward nudge centered on raw score ≈ 70, capping at 100. Prevents specialists from being unfairly penalised for narrow breadth.
- Anchor corrections: when server-measured signals (e.g. privateWorkLikely, multiRepoVolume, follower count) mandate a minimum category score, the aggregation step enforces it. These are signal-conditional, not identity-based floors.