Skip to content

Crawl & Search

Guide: Crawl Web & Search Images

Overview

You can use RevidAPI public endpoints to: - Crawl website content (returns markdown/content and embedded images/videos depending on mode). - Search images by keyword (returns an image list and minimal metadata that you can use for thumbnails/suggestions).

Pricing

5 credits per request

Fixed cost per call, not dependent on the length of the response content.

Endpoints

1. Crawl web pages

  • URL: POST https://api.revidapi.com/paid/website/crawl
  • Method: POST

2. Search images by keyword

  • URL: POST https://api.revidapi.com/paid/search/image
  • Method: POST

Requirements

Headers (common)

  • x-api-key: Required

Request Body

1. Crawl web (POST /paid/website/crawl)

Parameter Type Required Description
url string Full URL to crawl (any website / article / landing page).
mode string web: prioritize returning links (list of URLs) + markdown/content; do not fetch images. article: prioritize returning main content + embedded images/videos in the article.

Example requests (cURL)

curl -s -X POST "https://api.revidapi.com/paid/website/crawl" \
  -H "Content-Type: application/json" \
  -H "x-api-key: YOUR_X_API_KEY" \
  -d '{"url":"https://example.com","mode":"web"}'

mode=article (get main content + images/videos)

curl -s -X POST "https://api.revidapi.com/paid/website/crawl" \
  -H "Content-Type: application/json" \
  -H "x-api-key: YOUR_X_API_KEY" \
  -d '{"url":"https://example.com/article","mode":"article"}'

2. Search images by keyword (POST /paid/search/image)

Parameter Type Required Description
keyword string Image search keyword (English or Vietnamese are both OK).
max_results int Maximum number of images to return (1-100). Default: 20.

Example request (cURL)

curl -s -X POST "https://api.revidapi.com/paid/search/image" \
  -H "Content-Type: application/json" \
  -H "x-api-key: YOUR_X_API_KEY" \
  -d '{"keyword":"sunset beach","max_results":20}'

Response

1. Crawl web

The response JSON looks like:

{
  "success": true,
  "url": "https://example.com",
  "title": "Article title",
  "markdown": "...",
  "content": "...",
  "images": [{"url":"https://.../image.jpg","alt":"..."}],
  "videos": [{"url":"https://...","type":"embed"}],
  "links": ["https://...","https://..."]
}

Notes by mode: - mode=web: returns links; does not return images (this documentation guarantees no images in mode web). - mode=article: returns markdown/content + images (images inside the article, before the "Related news" section) and videos (embedded video URL if available); always returns links: [].

2. Search images by keyword

Minimal JSON response:

{
  "success": true,
  "keyword": "sunset beach",
  "images": [
    {
      "url": "https://.../image.jpg",
      "title": "Image title",
      "source": "https://source-site.com/article",
      "thumbnail": "https://.../thumb.jpg"
    }
  ],
  "count": 20
}

Error Responses

Status Description
400 Invalid request (missing/wrong parameter format).
401 Invalid x-api-key.
429 Overloaded / queue limit exceeded (depends on system configuration).
502 Cannot fetch content/images from the source (timeout, blocked access, or fetch errors).

Suggested Workflow

  1. Call paid/website/crawl with mode=web to get a list of links.
  2. Loop over each link, call paid/website/crawl with mode=article to get the main content + embedded images/videos.