Google News Scraper: Articles, Sources and Publish Dates
Scrape news articles from Google News for any search query. Returns clean structured data: article title (without source suffix), publisher name, publisher URL, direct article URL, publish date in RFC 2822 format, and the search query that matched the article. Filter by country and language. HTTP-only Go binary, no browser.
Coverage
Global, 100+ countries
What you can extract
- Article title: cleaned, without the source suffix that Google News appends
- Publisher: source name (Reuters, CNN, TechCrunch, etc.) and publisher_url
- Article URL: direct link to the published article (not the Google News redirect)
- Publish date: RFC 2822 format, ready for parsing into any datetime library
- Search query: the query that matched, useful when running multi-query batches
- Filters: language code (en, de, fr, etc.) and country code for regional results
Primary use cases
- Media monitoring. Track brand mentions, competitor news and industry developments across publishers
- Sentiment analysis. Feed clean headlines into NLP pipelines for market sentiment scoring
- Trend detection. Identify emerging topics before they go mainstream by tracking weekly article counts
- Content aggregation. Build news feeds, executive digests or topic dashboards
- Academic research. Analyze media coverage patterns and news framing across countries
- SEO and AEO research. See which sources rank for your target topics in Google's news ecosystem
API example
curl -X POST "https://api.apify.com/v2/acts/santamaria-automations~google-news-scraper/runs?token=YOUR_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"searchQueries": ["artificial intelligence", "climate change"],
"language": "en",
"country": "US",
"maxResults": 100,
"maxResultsPerQuery": 50
}'
Integrations
- n8n, Make, Zapier: trigger searches and route articles to Slack, email or BI
- AI Agents (MCP): Claude Desktop, Cursor, VS Code, LangChain, LlamaIndex
- Python, Node.js: Apify SDK for programmatic access
- Google Sheets, Airtable, BigQuery: query list in, article records out
Output sample
{
"title": "OpenAI Announces New GPT Model with Multimodal Capabilities",
"source": "Reuters",
"publisher_url": "https://www.reuters.com",
"url": "https://www.reuters.com/technology/openai-announces-new-gpt-model-...",
"published_at": "Thu, 12 Jun 2026 14:30:00 +0000",
"search_query": "artificial intelligence"
}
Related Actors