Extract Wikipedia articles by title, URL, or Wikidata ID via the official MediaWiki API. Returns the article title, summary (first paragraph), full plaintext content, thumbnail image, lead image, all categories, geographic coordinates when applicable, the Wikidata Q-ID, pageviews over the last 30 days, and references to other languages. Supports 12+ languages including English, German, French, Spanish, Chinese, and Japanese. Ideal for AI training data, knowledge graph enrichment, RAG ingestion, and reference databases.
# Start a run via the Apify API curl -X POST "https://api.apify.com/v2/acts/santamaria-automations~wikipedia-scraper/runs?token=YOUR_TOKEN" \ -H "Content-Type: application/json" \ -d '{ "queries": [ "Stripe (company)", "OpenAI", "Eiffel Tower", "Albert Einstein", "Transformer (machine learning model)" ], "language": "en", "extractFullContent": true, "includePageviews": true }' # Or use with AI agents via MCP: # https://mcp.apify.com?tools=santamaria-automations/wikipedia-scraper
| Field | Type | Example |
|---|---|---|
| title | string | Eiffel Tower |
| page_id | integer | 9202 |
| wikidata_id | string | Q243 |
| summary | string | The Eiffel Tower is a wrought-iron lattice tower... |
| content | string | The Eiffel Tower (French: La Tour Eiffel)... |
| thumbnail_url | string | https://upload.wikimedia.org/wikipedia/commons/... |
| categories | array | ["Eiffel Tower","Towers in Paris"] |
| coordinates | object | {"lat":48.8584,"lon":2.2945} |
| pageviews_30d | integer | 512,847 |
| canonical_url | string | https://en.wikipedia.org/wiki/Eiffel_Tower |