gstack
garrytan/gstack β 23+ opinionated skills for Claude Code from Garry Tan (CEO of Y Combinator). Turns Claude Code into a virtual team: a CEO who thinks the problem through, an eng manager who freezes the architecture, a designer who catches AI slop, a reviewer hunting production bugs, a QA opening a real browser, a security officer (OWASP+STRIDE), a release engineer opening the PR. Everything as slash commands in Markdown, MIT licensed.
Authorβs reference point: in 60 days he shipped 600k+ lines of production code (35% tests), 10-20k lines per day part-time, while writing for YC full-time.
π Links
Description
- Repo: https://github.com/garrytan/gstack
- License: MIT
- Requirements: Claude Code, Git, Bun β₯1.0, Node.js (Windows)
Download or use
# Paste the prompt into Claude Code (installs globally + appends a section to CLAUDE.md):
# "Install gstack: run git clone --single-branch --depth 1
# https://github.com/garrytan/gstack.git ~/.claude/skills/gstack
# && cd ~/.claude/skills/gstack && ./setup ..."Also works on Codex / Gemini CLI / Cursor / Factory Droid via --host codex|auto|factory.
ποΈ Description
π§© 7-step sprint (Think β Plan β Build β Review β Test β Ship β Reflect)
Key idea: the skills feed each other. /office-hours writes a design doc β /plan-ceo-review reads it β /plan-eng-review produces a test plan β /qa runs it β /review catches bugs β /ship verifies the fix.
| Skill | Role | What it does |
|---|---|---|
/office-hours | YC Office Hours | 6 forcing questions before code; pushback on framing, alternatives, design doc for downstream |
/plan-ceo-review | CEO/Founder | Find the 10-star product within the request; 4 modes (Expansion/Selective/Hold/Reduction) |
/plan-eng-review | Eng Manager | Architecture, data flow, ASCII diagrams, edge cases, test matrix |
/plan-design-review | Senior Designer | 0-10 score per design dimension; AI Slop detection; AskUserQuestion per choice |
/design-consultation | Design Partner | Full design system from scratch (research + creative risks + mockups) |
/design-shotgun | Design Explorer | Many variants, comparison board in the browser, taste memory |
/design-html | Design Engineer | Production HTML with Pretext (text reflow, dynamic heights), framework detection |
/review | Staff Engineer | Bugs that will pass CI and blow up in prod; auto-fix + completeness gaps |
/investigate | Debugger | Iron Law: no fixes without investigation; 3-fail stop rule |
/cso | Chief Security Officer | OWASP Top 10 + STRIDE; 17 false-positive exclusions, 8/10+ confidence gate |
/qa | QA Lead | Real browser, clicks the flows, fixes them, regression test per fix |
/ship | Release Engineer | Sync main + tests + coverage audit + push + PR; bootstraps a test framework if missing |
/land-and-deploy | Release Engineer | Merge β wait for CI β deploy β verify production health |
/canary | SRE | Post-deploy monitoring (console errors, perf, page failures) |
/codex | Second Opinion | Independent review with OpenAI Codex CLI; 3 modes (review/adversarial/consultation) |
/retro | Eng Manager | Weekly retro per-person; /retro global cross-projects + cross-AI (CC/Codex/Gemini) |
/learn | Memory | Cross-session learnings: review/search/prune/export project patterns |
π§© Power tools
/careful (warning before rm -rf/DROP TABLE/force-push), /freeze (lock edits to a single directory), /guard (= careful + freeze), /connect-chrome (Side Panel extension, watch live), /setup-deploy, /gstack-upgrade.
π§© Real browser mode
$B connect launches your Chrome as a headed window controlled by Playwright β you see every click the agent makes live. The side panel = chat sidebar for directly steering Claude. $B handoff for CAPTCHA/MFA: open a visible Chrome with cookies, solve it, $B resume.
βοΈ Reasoning for
The strongest pieces for me: /qa with a real browser + /codex as a cross-model second opinion + /retro global. These are exactly the spots in my workflow on Qamera AI / PLSoft where I lose the most time on manual checking. /document-release also sounds like something I need for Brain (auto-updating README/CLAUDE.md/CONTRIBUTING).
Risks and limitations:
- 23+ skills = a large surface to learn. Plan: start with
/office-hours+/review+/ship+/qa, the rest as needed. - Opinionated stack β some bits (Bun-only, Pretext) may clash with existing conventions in a repo.
- Lots of overlap with Superpowers (TDD/brainstorm/plan/review) β I will need to pick one methodology, not both.
Alternatives considered
- Superpowers β more methodology-first (TDD, subagent-driven), gstack is more rolling-team
- Karpathy Skills β minimal 1-CLAUDE.md answer, gstack is maximalism in the other direction
- Archon β YAML workflow engine + worktree isolation; gstack stays in native Claude Code skills
- Awesome Claude Code β curated list, gstack is a ready-made stack
π Resources
- Karpathyβs note (βI havenβt written a line of code since Decemberβ) β No Priors podcast, March 2026
- Peter Steinberger / OpenClaw (247K stars, solo with agents) as inspiration
- Skills deep-dive: https://github.com/garrytan/gstack/blob/main/docs/skills.md
- Claude Code β primary host
- Agent Skills β SKILL.md standard
- Awesome Claude Code β curated list of resources
- Superpowers β alternative approach
Template: tool