How did GitHub get their first customers?

GitHub got their first customers through Technical preview invitation to tens of thousands of developers. Their most effective growth channel became word-of-mouth.

What tech stack does GitHub use?

GitHub is built with OpenAI Codex, GPT-3, Visual Studio Code, IntelliJ, Vim, and 7 more technologies.

← Back to browse

GitHub

via Lennys Podcast

SaaS product-hunt-launch subscriptionexisting-tool-frustration

See all SaaS companies using product hunt launch →

Growthproduct hunt launch

Time to PMFapproximately 16 months

Pricingsubscription

Built inapproximately 16 months from initial model experimentation to general availability

The Spark

GitHub Copilot's origin story begins with an unexpected incident. OpenAI hammered GitHub's infrastructure with massive clone requests to harvest public code for training large language models. Rather than viewing this as a problem, Ryan Salva and the GitHub team saw an opportunity. Microsoft and OpenAI had been collaborating on large language models, and the team realized that programming languages—Python, JavaScript, Java, C#—are actually "languages" in the AI sense, with constrained semantics that make them excellent candidates for language model training.

The connection to the Arctic Code Vault, GitHub's year-old initiative to preserve public code on silver film in Finland for thousands of years, provided an elegant solution. "We took that same data snapshot and we brought it to our friends over at OpenAI," Salva explains. The question became: what could they do with large language models trained on public code?

Building the First Version

The initial phase was pure research. OpenAI's world-class researchers experimented with the data, tuned thousands of model parameters, and sent back prototypes for GitHub to test. The team experimented with different user experiences—side panels, right-click menus—before landing on inline autocomplete in the editor. The gray, italicized text presentation became the signature UX, subtly indicating the suggestions were ephemeral and optional.

A critical insight emerged: performance mattered enormously. Developers needed suggestions within approximately 200 milliseconds to stay in flow; anything longer felt like an interruption. The team also invested heavily in "prompt crafting"—learning how to structure requests to the model to get genuinely useful responses. This wasn't just about the model; it was about how you used it.

Finding the First Customers

The work began within GitHub Next, the company's R&D team tasked with "moonshots"—second and third horizon projects that might not bear fruit for years. When early results showed promise, the team transitioned from pure research to market testing. They created a technical preview and invited tens of thousands, then hundreds of thousands of developers. The response was overwhelming: "crazy, mind blown emoji tweets and threads on Hacker News," Salva recalls. Developers experienced something genuinely magical—code suggestions that understood context and could complete entire functions.

What Worked (and What Didn't)

Scaling brought unexpected challenges. First, hardware scarcity: Copilot requires rare, specialized GPUs in short global supply. "We've actually earmarked quite a bit of capacity and we're greedy, greedy, greedy for more," Salva says.

Second, community trust. The ethical and legal implications of training on public code required constant dialogue with the developer community. Early versions had no content filtering, but offensive outputs damaged the experience. The team experimented with blocklists before partnering with Azure's Department of Responsible AI to deploy sentiment detection models that understood context—crucial for medical software scenarios where certain words might be appropriate or offensive depending on use.

Third, organizational structure. Moving Copilot from R&D to a productized team required careful knowledge transfer. Researchers had to be replaced in-seat by engineers who'd absorbed their expertise before moving back to GitHub Next. The product team had to own the roadmap, not outsource innovation to researchers. Engineering fundamentals—service operations, monitoring, security, privacy—felt unnatural to researchers but were non-negotiable for a production product.

Interestingly, scaling the *product team* proved more important than scaling engineering. "Every good product manager should spend as much time as possible with customers," Salva notes. Copilot required more product and community management capacity than traditional products because skepticism was healthy and necessary. Concerns about model poisoning, security, training data ethics, and AI's long-term role in software development demanded genuine dialogue, not dismissal.

Where They Are Now

Approximately 16 months from initial experimentation to general availability, Copilot has achieved remarkable adoption. Python developers now write approximately 40% of their code with Copilot assistance (ranging from upper 20s to 40s across languages). The product has scaled from a research curiosity to a core GitHub offering.

Ryan Salva's broader lesson on portfolio management at larger companies: allocate roughly 5-10% of team capacity to bold experimental bets like GitHub Next, 25-30% to operational excellence of existing products, and 60% to incremental improvements on proven winners. This mix ensures innovation without sacrificing reliability or user satisfaction. For Copilot specifically, the vision extends beyond code generation: AI will infuse the entire development stack, from PR summaries to build optimization, freeing developers to focus on creative design and outcome-driven thinking rather than rote memorization and syntax.

Why It Worked

•GitHub leveraged an existing massive dataset of public code combined with breakthrough AI capabilities to solve a genuine pain point developers experienced daily, creating immediate product-market fit rather than searching for use cases.
•The technical preview program with hundreds of thousands of developers provided real-world feedback at scale that revealed critical performance requirements (200ms latency) and ethical concerns, allowing the team to refine both the product and trust mechanisms before general availability.
•Word-of-mouth virality emerged naturally because the product delivered a genuinely novel experience that exceeded developers' expectations, turning users into organic advocates who shared their enthusiasm on social platforms and developer communities.
•The 16-month timeline from experimentation to general availability matched the time needed to not only build the core technology but also discover and solve the non-obvious challenges like GPU scarcity and content filtering that could have derailed a faster launch.

How to Replicate

1.Identify a large, high-quality dataset you have privileged access to that aligns with emerging AI capabilities, then experiment with how that combination could solve a specific, frequent pain point in your target user's workflow.
2.Design and launch a technical preview program that invites a substantial portion of your target audience (not just early adopters), and systematically gather feedback on performance thresholds and trust concerns before scaling.
3.Invest deeply in the implementation details that keep users in flow—measure and optimize for latency targets specific to your use case, and test different UX presentations to signal the nature of AI-generated suggestions to users.
4.Establish feedback loops with your community on ethical and safety implications of your product early, and build content filtering or moderation partnerships into your roadmap rather than treating them as post-launch additions.

Similar Companies

247.ai

$25.0M/mo

247.ai, founded by PV Cannon in 2000, is an AI-powered customer service automation platform serving over 150 enterprise customers with $300M+ in ARR. The company raised only $20M from Sequoia (2003) and bootstrap, achieving 10% net profit margins while maintaining a 12-month CAC payback period and 100% net revenue retention. Despite a security breach setback around 2018, 247.ai has recovered and recently achieved 20% new revenue booking growth in their best quarter.

iCIMS

$13.3M/mo

iCIMS is a bootstrapped SaaS provider founded in 1999 that dominates the talent acquisition software market as the #2 player, serving 3,500 enterprise customers with an average monthly spend of $4,000. The company exited 2017 with $160M ARR and is targeting 25%+ annual growth while maintaining profitability, recently acquiring Text Recruit to expand into candidate messaging and recruitment advertising.

Zoom

$12.0M/mo

Zoom is a freemium SaaS video conferencing platform founded by Eric Yuan in July 2011 after he left Cisco to build a next-generation collaboration solution. The company has grown to 850,000+ paying customers across individual, SMB, and enterprise segments, generating over $12M in monthly recurring revenue with approximately 100% year-over-year growth. Rather than focusing on customer stickiness or aggressive growth targets, Zoom emphasizes customer happiness and organic word-of-mouth acquisition, which has proven highly effective in driving viral adoption.

Madwire

$10.0M/mo

Madwire is a comprehensive SaaS platform for small businesses (1-100 employees) that combines CRM, payments, invoicing, billing, e-commerce, and multi-channel marketing tools in a single platform. Founded in 2009, the company has grown to $120M ARR serving 20,000 customers with an average revenue per user of $500/month, while maintaining strong unit economics ($3,000-$4,000 CAC with 3-month payback) and recently turning profitable with a focus on reaching 15-20% EBITDA margins. The company is exploring an IPO within 12-18 months without having raised substantial capital beyond an initial $7.5M.

SwiftPage

$7.0M/mo

SwiftPage is a CRM and marketing automation platform founded in 2001 that targets small businesses. Under CEO John Oshel's leadership since 2012, the company scaled from 60,000 customers with $26.2M revenue in 2015 to 84,000 customers today with an estimated ARR of $36M+, maintaining 1.5% monthly logo churn and a 6-7 month payback period with a sub-$500 CAC.

Read original source