01 · Roasts
The Invisible Shipper
You built a whole personal portfolio site (portfolio repo) to show off your work... and forgot to write a README for it. The website that promotes your projects can't even promote itself.
13-Day Research Paper Drop
LLM-Confidence-Calibration-Benchmark: 8 stars, 4 datasets, 8 models, sophisticated ECE metrics — and exactly 13 days of commit history. Either you're a genius or you just sprinted to the finish line and never looked back.
Heatmap Fade-Out
Your heatmap looks like a campfire dying out — intense bursts in weeks 1–10, then a long quiet winter of almost nothing for weeks 15–45. 134 public commits in a year is 'checking in occasionally,' not shipping.
Issue Zero
totalIssuesYear=0. You opened zero issues across all of GitHub this year. For someone billing themselves as SDE + AI&ML Engineer, that's an impressive level of silence on every open source project you've ever used.
90% Solo Artist
soloPct=90: nine out of ten commits happen in complete isolation. The @flixsrota namespace implies a brand, but a brand with no community engagement is just a logo with nobody watching.
Built using
Zoral
Shadows one worker for a week, then takes over their job with zero extra setup. Behaves exactly like the original.
zoral.ai
02 · Category breakdown
- Impact25% weight48D
- Consistency20% weight55D
- Quality20% weight67C
- Depth15% weight55D
- Breadth10% weight65C
- Community10% weight40D
03 · Stats
365-day commit heatmap
92 active days
Language distribution
- HTML37%
- TypeScript33%
- TeX8%
- Python7%
- Java4%
- Jupyter Notebook3%
- Other8%
04 · Numbers
Owned repos
non-fork
23
Commits
last 12 months
134
Followers
14
Joined GitHub
Apr 2019
05 · Top repos
NIKHIL0VERMA /
flixsrota-player
Typed React Native/Expo YouTube player with functional architecture, CI/CD, and structured codebase. Early-stage indie project with clear API, cross-platform support, and complete build tooling.
NIKHIL0VERMA /
LLM-Confidence-Calibration-Benchmark
Research benchmark evaluating LLM confidence calibration across GSM8K, BoolQ, TruthfulQA, CommonSenseQA using sentence embeddings and ECE/MCE metrics. Typed Python with structured src/ + notebooks, semantic evaluation and multi-model support, but very limited adoption (8 stars, 13 days old).
NIKHIL0VERMA /
portfolio
Personal portfolio built in Next.js with TypeScript, structured components, CI/CD via GitHub Actions, and Tailwind styling. No README; minimal documentation. Active commits across 4 months demonstrate sustained indie project work.
NIKHIL0VERMA /
NIKHIL0VERMA
A GitHub profile README with no code, no tests, no CI, and no license. Pure portfolio scaffolding—46 KB with badges and links only, no actual project implementation or substantive work.
06 · Timeline
- Apr 8, 2019Joined GitHub
- May 24, 2024Created NIKHIL0VERMA — Using the readme file to enhance my GitHub profile
- Sep 7, 2025Created flixsrota-player — A Youtube player for react-native and expo; distraction free(without recommendations, share buttons, or channel overlays). Future versions will integrate with the upcoming Flixsrot
- Dec 21, 2025Created portfolio — I'm a generalist based in India. I build solutions across software, hardware, and automation without tech stack boundaries.
- Mar 9, 2026Created LLM-Confidence-Calibration-Benchmark — To analyze whether modern open-source LLMs are well-calibrated, and how calibration varies across different task types such as reasoning, common sense, binary decision making, and
- Apr 16, 2026Most recent push to NIKHIL0VERMA
07 · Compare
08 · Rubric
How this score was produced
Overall = Σ (category × weight) + gentle top-end curve
Tier thresholds
▸ How the pipeline works
- 01Scrape.Pull every non-fork repo pushed in the last 90 days, plus your contribution calendar, followers, and language byte counts — straight from GitHub's REST & GraphQL APIs.
- 02Triage.A small model reads every repo's file tree + README and picks the 20 files per repo that actually reveal how you code.
- 03Grade each repo. All repos run in parallel through a fast scoring model that reads the picked files and rates each one independently on Impact, Quality, and Depth — with evidence citations.
- 04Aggregate. A larger reasoning model combines the per-repo scores with server-computed stats (heatmap, commit cadence, language entropy, follower count) to produce the 6-dimension profile score + roasts.
- 05Correct.Deterministic server-side checks enforce anchor-scale floors (e.g. a profile with 2,000+ public commits can't score 30 Consistency) and recompute the final verdict.
~90 seconds per profile, ~$0.25 in compute. Total of ~240 files read across your top-12 repos. One rating per GitHub account per day.
▸ Data sources & caveats
- Heatmap & commit totals: GitHub GraphQL
contributionsCollection— covers the last 365 days, includes private repos when the user has opted in (default). - Language %: byte totals across the top 30 owned non-fork repos.
- Curve: a small upward nudge centered on raw score ≈ 70, capping at 100. Prevents specialists from being unfairly penalised for narrow breadth.
- Anchor corrections: when server-measured signals (e.g. privateWorkLikely, multiRepoVolume, follower count) mandate a minimum category score, the aggregation step enforces it. These are signal-conditional, not identity-based floors.