Md Kawsar kawsarlog

Hi, I'm Kawsar 👋

Web Scraping & Data Automation Engineer | I build Python pipelines that pull clean data from sites most tools can't touch

🖊️ Love to Write code
📝 Website https://kawsarlog.com/
💬 Ask me about anything, i am happy to help :)

About

I've extracted 15M+ records for clients across real estate, healthcare, and e-commerce, without getting blocked once.

For 9+ years I've built Python scrapers and automation that pull clean, structured data from the platforms most tools choke on: Zillow, Realtor.com, LoopNet, Crexi, Amazon, and more. If it sits behind a login, a CAPTCHA, or a messy private API, I've probably already cracked it.

What I build:

Custom scrapers, Selenium, Playwright, curl_cffi, Apify actors
API reverse-engineering and GraphQL extraction
Async, proxy-rotating pipelines that run in the cloud
Verified B2B & real estate lead lists, plus AI-ready datasets

Most people find me after another scraper broke, got blocked, or buckled the moment they tried to scale it. That's the work I like.

The track record: Fiverr Pro (top 1%) · 200+ projects on Upwork · 1,000+ hours logged. Currently founder of bigiByte, part of the TocoLabs studio network.

Stack

Scraping & automation: Selenium · Playwright · curl_cffi · Scrapy · BeautifulSoup · Apify · aiohttp · rotating proxies Data & backend: Python · Pandas · PostgreSQL · SQLite · MongoDB · FastAPI · Flask · REST · GraphQL Infra: Docker · AWS (Lambda, API Gateway) · Linux

📌 What I work on

Reverse-engineering platform APIs (HouseSigma, LoopNet, Crexi, Zillow) for clean, scalable data access
Apify actors for real estate, e-commerce, and lead-gen
Async pipelines processing 100K+ records per run with proxy rotation and resume capability
LLM-powered extraction and enrichment on top of raw scraped data

Problems I get hired to solve

The situation	What I do
"The site blocks our scraper"	Anti-bot bypass — Cloudflare, CAPTCHA, fingerprinting
"We need 100K rows, not 100"	Async, proxy-rotating pipelines that scale and resume
"The data's behind a private API"	Reverse-engineer it, pull it clean
"We need leads, not raw HTML"	Verified B2B & real estate lists, deduped and structured
"Make it run on its own"	Scheduled cloud jobs on AWS Lambda + API Gateway

GitHub Stats

Connect

Got a scraping job another tool couldn't handle? Let's talk →

Provide feedback

Saved searches

Use saved searches to filter your results more quickly