aakarsh · 36.9/100 — Rate My GitHub

01 · Roasts

442 repos, 3 stars — collector's edition

You've been on GitHub since 2009 — 16 years, 442 public repos — and the entire portfolio has earned 3 stars. That's one star per 5.3 years of effort. The archive is impressive; the audience is not.

The README said 'requirements. tx'

rl-llm-calibration-test is your strongest repo and its README has a typo in the install instructions. If your best foot forward has a typo, the other 441 repos must be barefoot.

Zero PRs, zero issues, 1651 following

You follow 1,651 people but opened zero external PRs and zero issues this year. That's not community engagement — that's a very curated reading list.

Two repos born and died on the same day

cs-510-computational-imaging and oresat-startracker-calibration-test both have their first and last commit on 2023-02-08. They lived their entire lives in a single afternoon. RIP.

HDL king of nobody's court

Verilog and VHDL together make up 35% of your codebase — more than any single language. Zero stars, zero forks. The synthesis reports compile; the GitHub metrics do not.

Built using

Zoral

Shadows one worker for a week, then takes over their job with zero extra setup. Behaves exactly like the original.

zoral.ai

02 · Category breakdown

Impact
25% weight
28F
Consistency
20% weight
35F
Quality
20% weight
33F
Depth
15% weight
35F
Breadth
10% weight
65C
Community
10% weight
40D

03 · Stats

365-day commit heatmap

135 active days

Less

Language distribution

7 langs

TeX38%
Verilog18%
VHDL17%
Jupyter Notebook8%
HTML8%
C++6%
Other5%

04 · Numbers

Owned repos

non-fork

Commits

last 12 months

116

Followers

121

Joined GitHub

Apr 2009

05 · Top repos

aakarsh /

rl-llm-calibration-test

35/100

Research replication codebase measuring LLM calibration on open datasets (MMLU, LogicQA, TruthfulQA). Typed Python with structured runner modules, has tests and CI, but limited documentation and nascent research focus. Single author, minimal stars/forks.

I25Q45D35

READMETestsCI

Jupyter Notebook★ 12y ago

aakarsh /

cs-510-computational-imaging

15/100

Course assignment repository with minimal content (224KB, TeX language). README is nearly empty, no tests/CI, 5 commits over ~3 hours on 2023-02-08. Appears to be a one-off academic submission.

I15Q25D5

README

TeX★ 13y ago

aakarsh /

oresat-startracker-calibration-test

15/100

One-off calibration script for OreSat star tracker with minimal docs, no tests, no CI, and only ~28KB of code committed in a single day (4 of 30 commits sampled).

I15Q25D5

Python★ 13y ago

06 · Timeline

Apr 18, 2009
Joined GitHub
Feb 8, 2023
Created cs-510-computational-imaging
Feb 8, 2023
Created oresat-startracker-calibration-test
Mar 11, 2024
Created rl-llm-calibration-test — Attempt at replication of the parts of the paper "Language models (mostly) know what they know", on open datasets, and models.
Apr 16, 2024
Most recent push to rl-llm-calibration-test

07 · Compare

Compare aakarsh against

github.com/

aakarsh · 6dmedian coder

08 · Rubric

How this score was produced

Overall = Σ (category × weight) + gentle top-end curve

CategoryWeightScoreContrib.

Raw total36.4

Top-end curve+0.5

Final overall36.9

Tier thresholds

S90–100Mass-producing humansA80–89Ship machineB70–79Solid engineerC60–69Getting thereD40–59README enthusiastF0–39GitHub tourist

▸ How the pipeline works

01Scrape.Pull every non-fork repo pushed in the last 90 days, plus your contribution calendar, followers, and language byte counts — straight from GitHub's REST & GraphQL APIs.
02Triage.A small model reads every repo's file tree + README and picks the 20 files per repo that actually reveal how you code.
03Grade each repo. All repos run in parallel through a fast scoring model that reads the picked files and rates each one independently on Impact, Quality, and Depth — with evidence citations.
04Aggregate. A larger reasoning model combines the per-repo scores with server-computed stats (heatmap, commit cadence, language entropy, follower count) to produce the 6-dimension profile score + roasts.
05Correct.Deterministic server-side checks enforce anchor-scale floors (e.g. a profile with 2,000+ public commits can't score 30 Consistency) and recompute the final verdict.

~90 seconds per profile, ~$0.25 in compute. Total of ~240 files read across your top-12 repos. One rating per GitHub account per day.

▸ Data sources & caveats

Heatmap & commit totals: GitHub GraphQL contributionsCollection — covers the last 365 days, includes private repos when the user has opted in (default).
Language %: byte totals across the top 30 owned non-fork repos.
Curve: a small upward nudge centered on raw score ≈ 70, capping at 100. Prevents specialists from being unfairly penalised for narrow breadth.
Anchor corrections: when server-measured signals (e.g. privateWorkLikely, multiRepoVolume, follower count) mandate a minimum category score, the aggregation step enforces it. These are signal-conditional, not identity-based floors.