01 · Roasts
21K Followers, 21 Commits
You have 21,292 followers but made exactly 21 public commits this year — that's one commit per 1,014 fans. Your audience is clearly more productive than you are.
The One-Day Architect
SnowflakeCourse: created April 21, last push April 21. DatabricksIDP: born Feb 3, died Feb 4. You don't build repos, you *deposit* them.
Following: 0
You follow exactly zero people on GitHub. With 21K followers, you've built a megaphone with no ears — 1 PR and 0 issues opened this year confirms you've never once peeked at anyone else's code.
82% Jupyter, 0% Tests
Your entire codebase is essentially a pile of notebooks with no tests, no CI, and no licenses anywhere. It's less a portfolio and more a collection of Ctrl+Shift+Enter sessions.
Data Analyst, No Data on Your Own Work
DatabricksSeries ships users_dirty.csv with dates like '2.29.24' — a data analyst whose own tutorial data fails basic date validation is a bold choice.
Built using
Zoral
Shadows one worker for a week, then takes over their job with zero extra setup. Behaves exactly like the original.
zoral.ai
02 · Category breakdown
- Impact25% weight43D
- Consistency20% weight55D
- Quality20% weight28F
- Depth15% weight25F
- Breadth10% weight35F
- Community10% weight75B
03 · Stats
365-day commit heatmap
10 active days
Language distribution
- Jupyter Notebook82%
- HTML13%
- CSS2%
- SCSS2%
- JavaScript0%
- Python0%
- Other1%
04 · Numbers
Owned repos
non-fork
19
Commits
last 12 months
21
Followers
21,292
Joined GitHub
Jan 2020
05 · Top repos
AlexTheAnalyst /
SnowflakeCourse
Realistic sample dataset for Snowflake training with 500 users, 9 months of activity (~150k rows across 10 files). Well-documented use cases and intentional data quality issues. No code, no tests, repo is pure data files and documentation.
AlexTheAnalyst /
DatabricksSeries
Minimal data engineering tutorial collection with sample CSVs and notebooks; no documentation, tests, CI, or version control discipline; 5 of last 30 commits suggesting sparse recent activity.
AlexTheAnalyst /
DatabricksIDP
Minimal experimental project: 64 KB repo with 10 commits over 2 days, no documentation, tests, CI, or license. Appears to be early-stage exploration.
06 · Timeline
- Jan 28, 2020Joined GitHub
- Nov 13, 2025Created DatabricksSeries
- Feb 3, 2026Created DatabricksIDP
- Apr 21, 2026Created SnowflakeCourse
- Apr 21, 2026Most recent push to SnowflakeCourse
07 · Compare
08 · Rubric
How this score was produced
Overall = Σ (category × weight) + gentle top-end curve
Tier thresholds
▸ How the pipeline works
- 01Scrape.Pull every non-fork repo pushed in the last 90 days, plus your contribution calendar, followers, and language byte counts — straight from GitHub's REST & GraphQL APIs.
- 02Triage.A small model reads every repo's file tree + README and picks the 20 files per repo that actually reveal how you code.
- 03Grade each repo. All repos run in parallel through a fast scoring model that reads the picked files and rates each one independently on Impact, Quality, and Depth — with evidence citations.
- 04Aggregate. A larger reasoning model combines the per-repo scores with server-computed stats (heatmap, commit cadence, language entropy, follower count) to produce the 6-dimension profile score + roasts.
- 05Correct.Deterministic server-side checks enforce anchor-scale floors (e.g. a profile with 2,000+ public commits can't score 30 Consistency) and recompute the final verdict.
~90 seconds per profile, ~$0.25 in compute. Total of ~240 files read across your top-12 repos. One rating per GitHub account per day.
▸ Data sources & caveats
- Heatmap & commit totals: GitHub GraphQL
contributionsCollection— covers the last 365 days, includes private repos when the user has opted in (default). - Language %: byte totals across the top 30 owned non-fork repos.
- Curve: a small upward nudge centered on raw score ≈ 70, capping at 100. Prevents specialists from being unfairly penalised for narrow breadth.
- Anchor corrections: when server-measured signals (e.g. privateWorkLikely, multiRepoVolume, follower count) mandate a minimum category score, the aggregation step enforces it. These are signal-conditional, not identity-based floors.