Scrape Hacker News — Stories, Jobs, Comments & Users
Extract structured data from Y Combinator's Hacker News. Scrape top stories, new stories, job postings, Ask HN threads, Show HN projects, comments, and user profiles. HTTP-only — no Chrome, no Playwright.
Anti-Bot
TLS fingerprinting
What You Can Scrape
- Top stories — title, URL, score, author, comment count
- Job postings — YC startup job listings from the monthly "Who is Hiring" threads
- Comments — full comment threads with nesting and author info
- User profiles — karma, created date, about text
- Ask HN / Show HN — community questions and project showcases
- Search — full-text search across all HN content
Primary Use Cases
- Tech trend monitoring — track what the tech community is discussing
- Startup hiring data — scrape YC startup job postings for market intelligence
- Sentiment analysis — analyze community reactions to product launches or tech news
- Content research — find top-performing content topics for tech blogs
- Competitive intelligence — monitor mentions of your company or competitors
API Example
curl -X POST "https://api.apify.com/v2/acts/santamaria-automations~ycombinator-scraper/runs?token=YOUR_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"mode": "top",
"maxResults": 100,
"includeComments": true
}'
Integrations
- n8n / Make / Zapier — trigger runs and process results via webhook
- AI Agents (MCP) — Claude Desktop, Cursor, VS Code, LangChain, LlamaIndex
- Python / Node.js — Apify SDK for programmatic access
- Google Sheets / Airtable — export via Apify integrations
How It Compares
| Feature | NanoScrape | HN API wrappers |
| Docker image | ~12MB | Varies |
| RAM usage | 128MB | Varies |
| Cost per 1k items | $0.50 | $2 - $10 |
| Output format | Structured JSON | Raw API format |
| Comments included | Yes (nested) | Separate calls |
| Search | Built-in | Algolia API |
Output Fields
| Field | Type | Example |
| title | string | Show HN: I built a new database |
| url | string | https://example.com/article |
| score | number | 342 |
| author | string | pg |
| comment_count | number | 187 |
| created_at | string | 2026-04-23T08:15:00Z |
| type | string | story |
| hn_url | string | https://news.ycombinator.com/item?id=... |