Skip to content
View promisingcoder's full-sized avatar

Block or report promisingcoder

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
promisingcoder/README.md

Youssef Nagy Banner

Typing SVG

Data Crawler & Automation Engineer building production scraping systems and open-source extraction tools.
From closed APIs β†’ reverse-engineered protocols β†’ pip-installable packages β†’ data at scale.

PyPI GitHub Website

Repos LinkedIn Email


🧭 What I do

Area What you can expect
API Reverse Engineering Decode private protocols (protobuf, GraphQL, internal REST) β†’ direct HTTP extraction, no browser needed, 50x faster
Anti-Bot Evasion Bypass Cloudflare, Shape Security, Incapsula, DataDome using anti-detect browsers, TLS fingerprinting & ISP proxies
Scalable Data Pipelines Async scraping at 100K+ records/week β€’ proxy rotation β€’ checkpoint resumption β€’ structured output
Open-Source Tooling Production-ready pip packages with full docs, streaming APIs, event systems & CLI interfaces
Government Data Collection 23 US states automated β€” business registrations & professional licenses from gov portals

🌟 Featured

πŸ”“ Reverse-Engineered API Tools

Project What it does Link
GoogleMapsCollector Reverse-engineers Google Maps' internal protobuf API β€” 100K+ records/week, no browser, no API key Repo Β· pip install gmaps-extractor
MetaAdsCollector Reverse-engineers Meta's private GraphQL API β€” full Ad Library extraction across all countries Repo Β· pip install meta-ads-collector
google-maps-pb-decoder Protobuf decoder for Google Maps' binary wire format β€” research & extraction toolkit Repo

πŸ€– Intelligent Scrapers

Project What it does Link
generic-scraper-1 LLM-powered structured extraction from any website β€” define fields, get data, no selectors needed Repo Β· pip install scraper
linkedin-profile-extractor LinkedIn profile extraction with anti-detection β€” experience, education, skills, full profiles Repo
google-maps-scraper Google Maps scraping via browser automation with stealth mode Repo

πŸ›οΈ Government & Public Data

Project What it does Link
gov_websites_collector Collects business registrations & professional licenses from 23 US state government websites β€” Camoufox anti-detect + ISP proxies Repo

πŸ› οΈ Tech stack

Category Tools
Languages & Core Python JavaScript Bash Git Linux
Scraping & Automation Playwright Selenium Scrapy aiohttp Camoufox
Reverse Engineering Protobuf GraphQL TLS/SSL HTTP/2
Backend & APIs FastAPI Node.js React
Databases PostgreSQL MongoDB Redis
Infrastructure Docker Proxy Rotation ISP Proxies CAPTCHA Bypass

🀝 Collaboration

  • πŸ”“ API Reverse Engineering: Decode closed/private APIs (protobuf, GraphQL, internal REST) for direct, fast data extraction
  • πŸ›‘οΈ Anti-Bot Evasion: Defeat Cloudflare, Imperva, Shape Security, DataDome β€” TLS fingerprinting, anti-detect browsers, ISP proxies
  • ⚑ Scalable Scraping Systems: Async pipelines, proxy rotation, checkpoint resumption β€” 100K+ records/week on a single machine
  • πŸ“¦ Open-Source Tooling: Production-ready pip packages with streaming APIs, event systems, and full documentation

Best contact: LinkedIn

If you find the work useful, a ⭐ helps more people discover it.

Footer Banner

Popular repositories Loading

  1. MetaAdsCollector MetaAdsCollector Public

    Collect ads from Meta Ad Library. No API key required. Reverse-engineers the internal GraphQL API. All countries, all ad types.

    Python 29 8

  2. GoogleMapsCollector GoogleMapsCollector Public

    Extract businesses from Google Maps at scale. No browser needed. Reverse-engineers the internal API. 100K+ records/week.

    Python 6 2

  3. linkedin-profile-extractor linkedin-profile-extractor Public

    Extract LinkedIn profile data programmatically. Selenium-based scraper with anti-detection.

    Python 3

  4. promisingcoder promisingcoder Public

    My profile README

    1

  5. StableLM StableLM Public

    Forked from Stability-AI/StableLM

    StableLM: Stability AI Language Models

    Jupyter Notebook

  6. textgenrnn textgenrnn Public

    Forked from minimaxir/textgenrnn

    Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

    Python