AlexTheAnalyst · 43.3/100

01 · Roasts

21K Followers, 21 Commits

You have 21,292 followers but made exactly 21 public commits this year — that's one commit per 1,014 fans. Your audience is clearly more productive than you are.

The One-Day Architect

SnowflakeCourse: created April 21, last push April 21. DatabricksIDP: born Feb 3, died Feb 4. You don't build repos, you *deposit* them.

Following: 0

You follow exactly zero people on GitHub. With 21K followers, you've built a megaphone with no ears — 1 PR and 0 issues opened this year confirms you've never once peeked at anyone else's code.

82% Jupyter, 0% Tests

Your entire codebase is essentially a pile of notebooks with no tests, no CI, and no licenses anywhere. It's less a portfolio and more a collection of Ctrl+Shift+Enter sessions.

Data Analyst, No Data on Your Own Work

DatabricksSeries ships users_dirty.csv with dates like '2.29.24' — a data analyst whose own tutorial data fails basic date validation is a bold choice.

Built using

Zoral

Shadows one worker for a week, then takes over their job with zero extra setup. Behaves exactly like the original.

zoral.ai

02 · Category breakdown

Impact
25% weight
43D
Consistency
20% weight
55D
Quality
20% weight
28F
Depth
15% weight
25F
Breadth
10% weight
35F
Community
10% weight
75B

03 · Stats

365-day commit heatmap

10 active days

Less

Language distribution

7 langs

Jupyter Notebook82%
HTML13%
CSS2%
SCSS2%
JavaScript0%
Python0%
Other1%

04 · Numbers

Owned repos

non-fork

Commits

last 12 months

Followers

21,292

Joined GitHub

Jan 2020

05 · Top repos

AlexTheAnalyst /

SnowflakeCourse

25/100

Realistic sample dataset for Snowflake training with 500 users, 9 months of activity (~150k rows across 10 files). Well-documented use cases and intentional data quality issues. No code, no tests, repo is pure data files and documentation.

I15Q45D15

README

Unknown★ 63mo ago

AlexTheAnalyst /

DatabricksSeries

23/100

Minimal data engineering tutorial collection with sample CSVs and notebooks; no documentation, tests, CI, or version control discipline; 5 of last 30 commits suggesting sparse recent activity.

I30Q15D25

Unknown★ 663mo ago

AlexTheAnalyst /

DatabricksIDP

13/100

Minimal experimental project: 64 KB repo with 10 commits over 2 days, no documentation, tests, CI, or license. Appears to be early-stage exploration.

I15Q10D15

Unknown★ 105mo ago

06 · Timeline

Jan 28, 2020
Joined GitHub
Nov 13, 2025
Created DatabricksSeries
Feb 3, 2026
Created DatabricksIDP
Apr 21, 2026
Created SnowflakeCourse
Apr 21, 2026
Most recent push to SnowflakeCourse

07 · Compare

Compare AlexTheAnalyst against

github.com/

AlexTheAnalyst · 6dmedian coder

08 · Rubric

How this score was produced

Overall = Σ (category × weight) + gentle top-end curve

CategoryWeightScoreContrib.

Raw total42.1

Top-end curve+1.2

Final overall43.3

Tier thresholds

S90–100Mass-producing humansA80–89Ship machineB70–79Solid engineerC60–69Getting thereD40–59README enthusiastF0–39GitHub tourist

▸ How the pipeline works

01Scrape.Pull every non-fork repo pushed in the last 90 days, plus your contribution calendar, followers, and language byte counts — straight from GitHub's REST & GraphQL APIs.
02Triage.A small model reads every repo's file tree + README and picks the 20 files per repo that actually reveal how you code.
03Grade each repo. All repos run in parallel through a fast scoring model that reads the picked files and rates each one independently on Impact, Quality, and Depth — with evidence citations.
04Aggregate. A larger reasoning model combines the per-repo scores with server-computed stats (heatmap, commit cadence, language entropy, follower count) to produce the 6-dimension profile score + roasts.
05Correct.Deterministic server-side checks enforce anchor-scale floors (e.g. a profile with 2,000+ public commits can't score 30 Consistency) and recompute the final verdict.

~90 seconds per profile, ~$0.25 in compute. Total of ~240 files read across your top-12 repos. One rating per GitHub account per day.

▸ Data sources & caveats

Heatmap & commit totals: GitHub GraphQL contributionsCollection — covers the last 365 days, includes private repos when the user has opted in (default).
Language %: byte totals across the top 30 owned non-fork repos.
Curve: a small upward nudge centered on raw score ≈ 70, capping at 100. Prevents specialists from being unfairly penalised for narrow breadth.
Anchor corrections: when server-measured signals (e.g. privateWorkLikely, multiRepoVolume, follower count) mandate a minimum category score, the aggregation step enforces it. These are signal-conditional, not identity-based floors.