Wikipedia Scraper: Articles, Summaries, and Metadata, $0.50 per 1,000 Articles

Extract Wikipedia articles by title, URL, or Wikidata ID via the official MediaWiki API. Returns the article title, summary (first paragraph), full plaintext content, thumbnail image, lead image, all categories, geographic coordinates when applicable, the Wikidata Q-ID, pageviews over the last 30 days, and references to other languages. Supports 12+ languages including English, German, French, Spanish, Chinese, and Japanese. Ideal for AI training data, knowledge graph enrichment, RAG ingestion, and reference databases.

Open on Apify → Try it now
Pricing
$0.0005/article
RAM
128MB
Coverage
12+ languages
Output fields
20+
Proxy
Apify datacenter
Tech
MediaWiki API

What you get

Primary use cases

API example

# Start a run via the Apify API
curl -X POST "https://api.apify.com/v2/acts/santamaria-automations~wikipedia-scraper/runs?token=YOUR_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{
    "queries": [
      "Stripe (company)",
      "OpenAI",
      "Eiffel Tower",
      "Albert Einstein",
      "Transformer (machine learning model)"
    ],
    "language": "en",
    "extractFullContent": true,
    "includePageviews": true
  }'

# Or use with AI agents via MCP:
# https://mcp.apify.com?tools=santamaria-automations/wikipedia-scraper

Integrations

Output fields

FieldTypeExample
titlestringEiffel Tower
page_idinteger9202
wikidata_idstringQ243
summarystringThe Eiffel Tower is a wrought-iron lattice tower...
contentstringThe Eiffel Tower (French: La Tour Eiffel)...
thumbnail_urlstringhttps://upload.wikimedia.org/wikipedia/commons/...
categoriesarray["Eiffel Tower","Towers in Paris"]
coordinatesobject{"lat":48.8584,"lon":2.2945}
pageviews_30dinteger512,847
canonical_urlstringhttps://en.wikipedia.org/wiki/Eiffel_Tower

Related Actors

Open on Apify → Try it now (free tier available)