Skip to content

Web Scraping

Traditional scraping tools (Puppeteer, Playwright, Selenium) are easily detected by anti-bot systems. Veilus solves this by providing:

  • Real browser fingerprints — pass Canvas, WebGL, and AudioContext checks
  • Residential proxy support — rotate IPs per request
  • Profile persistence — maintain cookies across scraping sessions
  • VeilusFlow — record scraping flows visually, no code needed
  1. Create a profile with a proxy
  2. Navigate to the target website
  3. Start VeilusFlow recording
  4. Click on the data you want to extract (price, title, rating)
  5. Right-click each element → Extract Text
  6. Stop recording
  7. Run the flow — data is saved to CSV

For programmatic access, use Veilus’s local API:

// Connect to a running Veilus profile
const response = await fetch('http://localhost:9222/json/version');
const { webSocketDebuggerUrl } = await response.json();
// Use Chrome DevTools Protocol (CDP)
const browser = await puppeteer.connect({
browserWSEndpoint: webSocketDebuggerUrl
});
const page = await browser.newPage();
await page.goto('https://example.com/products');
// Extract data
const products = await page.evaluate(() => {
return [...document.querySelectorAll('.product')].map(el => ({
title: el.querySelector('.title')?.textContent,
price: el.querySelector('.price')?.textContent,
}));
});
Rule: Max 1 request per 3-5 seconds

Don’t scrape faster than a human would browse. Use random delays:

  • Page load: wait 2-5 seconds
  • Between items: wait 1-3 seconds
  • Between pages: wait 5-10 seconds
  • Use a different proxy for each scraping session
  • Rotate IP after every 50-100 pages
  • Use residential proxies for protected sites
  • Create a pool of 5-10 profiles with different fingerprints
  • Rotate between profiles during long scraping sessions
  • Each profile should have its own proxy
  • Save cookies and session data (profiles persist automatically)
  • Reuse the same profile+proxy for subsequent visits to the same site
  • This builds “reputation” with the target site

When a CAPTCHA appears during scraping:

  1. VeilusFlow can pause and wait for manual solving
  2. Auto-detection — VeilusFlow detects common CAPTCHA patterns
  3. Third-party solvers — Integrate 2captcha or anti-captcha via API

VeilusFlow extracted data can be exported as:

  • CSV — For spreadsheets
  • JSON — For programmatic processing
  • Clipboard — Copy-paste individual values