Common Crawl 2026年4月爬取数据和URL索引已上传至Hugging Face,用户可通过SQL直接查询超过21.9亿网页,无需下载,大幅降低数据处理门槛。
RT @vanstriendaniel: You can now run SQL over 2.19 BILLION web pages. Zero download!
@CommonCrawl April 2026 crawl + URL index are on @hug…
likes: 33 | retweets: 8 | replies: 4 | views: 10225