Code for the paper "Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching" (COLING 2025)
-
Updated
May 27, 2026 - Python
Code for the paper "Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching" (COLING 2025)
The official implementation of [Quality over Quantity: Boosting Data Efficiency Through Ensembled Multimodal Data Curation] in AAAI2025.
Our project for the "Data Intelligence Applications" exam at Politecnico di Milano. The project was about Social Influence and Pricing techniques applied to networks.
Decision intelligence platform for industrial manufacturing. Connects to CRM, ERP, and CMMS systems and monitors industry and macroeconomic conditions to compute leading indicators, generate predictions, and deliver daily executive briefings.
Patent intelligence for AI agents — patent search, USPTO data, patent landscape & pgvector prior-art search. MCP + x402.
Weather & climate intelligence for AI agents — current weather, forecast, historical, climate normals, alerts, agricultural & travel weather. MCP + x402.
Django-based job-search analytics platform with funnel metrics, data quality checks, workbook exports, evidence documentation, and 133 passing tests.
Open Source Intelligence MCP — repo health, dependency risk, trending, license checks. FoundryNet Data Network.
Fact Verification MCP — verify claims with cross-referenced sources + MINT provenance. FoundryNet Data Network.
Production-ready Multi-Agent Data Intelligence Platform :- autonomous SQL generation, statistical analysis & visualization powered by LangGraph, Qdrant RAG, and E2B sandboxed execution.
Derived financial intelligence for AI agents — insider patterns, earnings, institutional moves, ratio screening with a proprietary value score. MCP + x402.
Social Trends Intelligence MCP — trending topics, sentiment, viral content, brand mentions. FoundryNet Data Network.
Cybersecurity threat intelligence for AI agents — CVE search, EPSS exploit prediction, CISA KEV, IP reputation, threat feed. MCP + x402.
Government contract search and federal procurement data for AI agents — SAM.gov opportunities + USASpending awards, via MCP (x402).
Official Python SDK for CrawlSnap — the data intelligence API platform
Domain & brand intelligence for AI agents — company enrichment, domain intelligence, tech stack detection, brand research. MCP + x402.
Academic research intelligence for AI agents — paper search, scientific literature, citation analysis, arXiv, author metrics, trending research & semantic related-work (pgvector). MCP + x402.
Regulatory & compliance intelligence for AI agents — FDA recalls, federal register, enforcement actions, comment deadlines, by industry + severity. MCP + x402.
Add a description, image, and links to the data-intelligence topic page so that developers can more easily learn about it.
To associate your repository with the data-intelligence topic, visit your repo's landing page and select "manage topics."