LoreonLabsPlatform
DocsHome
  • Overview

Intelligence

  • Markets
  • Builders
  • Research
  • Ecosystems
  • Launchpads
  • Search
Builders

@commoncrawl

Sebastian Nagel

@sebastian-nagel · Konstanz, Germany

GitHubWebsite
Followers
131
Public repos
63
Stars (recent)
17
Ecosystems
1

Projects

Repositories this builder owns.

nutch
Mirror of Apache Nutch
2
webarchive-commons
Common web archive utility code.
—
warc-crawler
Process web archives (WARC format) with StormCrawler and index content into OpenSearch
9
crawler-commons
A set of reusable Java components that implement functionality common to any web crawler
—

Connected narratives

AI AgentsConsumer CryptoSocialFiStablecoinsAgent CommerceOnchain Apps

Related builders

Others building in the same ecosystem.

Jesse Pollak
1.3K followers
0xDeployer
— followers
Xen
— followers
Ahaan Raizada
— followers
Youssef
— followers
Igor Yuzo
— followers
nutch-test-single-node-cluster
No description.
5
storm-crawler
Web crawler SDK based on Apache Storm
1
jwarc
Java library for reading and writing WARC files with a typed API
—
uap-core
The regex file necessary to build language ports of Browserscope's user agent parser.
—

Recent activity

Most recently pushed work.

  • sebastian-nagel/nutch
    pushed 7h ago
  • sebastian-nagel/webarchive-commons
    pushed 5d ago
  • sebastian-nagel/warc-crawler
    pushed 17d ago
  • sebastian-nagel/crawler-commons
    pushed 26d ago