Domain Crawler

This application crawls websites for URLs and stores the domains in a nested key value database for reverse subdomain searches.

The database consists of 2 root buckets, uncrawled and all. See below for an example.

UNCRAWLED
    google.com
    facebook.com
    docs.google.com
ALL
    com
        google
            sheets
            docs
        facebook
    co
        uk
            amazon

Usage

Download the binary here

Start crawling using default options

./crawler

Use flag -h to show all arguments

./crawler -h

Example

./crawler -seed bbc.com -threads 200

While the crawler is stopped you can get stats or make a query using

./crawler -search google.com or ./crawler -stat

Build from source

Requirements

Golang 1.15

make build-linux

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
bin		bin
src		src
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
screenshot.png		screenshot.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Domain Crawler

Usage

Build from source

About

Releases 1

Packages

Languages

leona/domain-crawler

Folders and files

Latest commit

History

Repository files navigation

Domain Crawler

Usage

Build from source

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages