GitHub - needmorecowbell/Hamburglar: Hamburglar -- collect useful information from urls, directories, and files

The Hamburglar

Setup

There are 2 versions of hamburglar, full and lite. The main branch is the full version, and hamburglar lite is on a separate branch.

Hamburglar

Full fledged scraping tool for artifact retrieval from multiple sources. There are some dependencies, so install them first:

pip3 install -r requirements.txt

Hamburglar also has the option of checking against file signatures during a hexdump. It will get skipped if not set up. To get it working, you will need to first create the database and a user:

CREATE DATABASE 
CREATE USER 'hamman'@'localhost' IDENTIFIED BY 'deadbeef';
GRANT ALL PRIVILEGES ON fileSign.signatures TO 'hamman'@'localhost';

Then, run magic_sig_scraper. This can be run on a cronjob to regularly update it, or just run it once:

python3 magic_sig_scraper.py

Hamburglar Lite

Multithreaded and recursive directory scraping script. Stores useful information with the filepath and finding. Hamburglar lite will never require external packages, and will always remain as a single script. Setup is as simple as requesting the file and using it:

wget https://raw.githubusercontent.com/needmorecowbell/Hamburglar/hamburglar-lite/hamburglar-lite.py

This is designed to be quickly downloaded and executed on a machine.

Operation

usage: hamburglar.py [-h] [-g] [-x] [-v] [-w] [-i] [-o FILE] [-y YARA] path

positional arguments:
  path                  path to directory, url, or file, depending on flag
                        used

optional arguments:
  -h, --help            show this help message and exit
  -g, --git             sets hamburglar into git mode
  -x, --hexdump         give hexdump of file
  -v, --verbose         increase output verbosity
  -w, --web             sets Hamburgler to web request mode, enter url as path
  -i, --ioc             uses iocextract to parse contents
  -o FILE, --out FILE   write results to FILE
  -y YARA, --yara YARA  use yara ruleset for checking

Directory Traversal

python3 hamburglar.py ~/Directory/
- This will recursively scan for files in the given directory, then analyzes each file for a variety of findings using regex filters

Single File Analysis

python3 hamburglar.py ~/Directory/file.txt
- This will recursively scan for files in the given directory, then analyzes each file for a variety of findings using regex filters

YARA Rule Based Analysis

python3 hamburglar.py -y rules/ ~/Directory
- This will compile the yara rule files in the rules directory and then check them against every item in Directory.

Git Scraping Mode

python3 hamburglar.py -g https://www.github.com/needmorecowbell/Hamburglar
- Adding -y <rulepath> will allow the repo to be scraped using yara rules

Web Request Mode

python3 hamburglar.py -w https://google.com
- Adding a -w to hamburgler.py tells the script to handle the path as a url.
- Currently this does not spider the page, it just analyzes the requested html content

IOC Extraction

python3 hamburglar.py -w -i https://pastebin.com/SYisR95m
- Adding a -i will use iocextract to extract any ioc's from the requested url

Hex Dump Mode

python3 hamburglar.py -x ~/file-to-dump
- This just does a hex dump and nothing more right now -- could be piped into a file
- This will eventually be used for binary analysis

Tips

Adding -v will set the script into verbose mode, and -h will show details of available arguments
Adding -o FILENAME will set the results filename, this is especially useful in scripting situations where you might want multiple results tables (ie github repo spidering)

Settings

whitelistOn: turns on or off whitelist checking
maxWorkers: number of worker threads to run concurrently when reading file stack
whitelist: list of files or directories to exclusively scan for (if whitelistOn=True)
blacklist: list of files, extensions, or directories to block in scan
regexList: dictionary of regex filters with filter type as the key

The Hamburglar can find

ipv4 addresses (public and local)
emails
private keys
urls
ioc's (using iocextract)
cryptocurrency addresses
anything you can imagine using regex filters and yara rules

Example output:

{
    "/home/adam/Dev/test/email.txt": {
        "emails": "{'[email protected]'}"
    },
    "/home/adam/Dev/test/email2.txt": {
        "emails": "{'[email protected]'}"
    },
    "/home/adam/Dev/test/ips.txt": {
        "ipv4": "{'10.0.11.2', '192.168.1.1'}"
    },
    "/home/adam/Dev/test/test2/email.txt": {
        "emails": "{'[email protected]', '[email protected]'}"
    },
    "/home/adam/Dev/test/test2/ips.txt": {
        "ipv4": "{'10.0.11.2', '192.168.1.1'}"
    },
    "/home/adam/Dev/test/test2/links.txt": {
        "site": "{'http://login.web.com'}"
    }
}

Contributions

Please contribute! If there's an error let me know -- even better if you can fix it :)
A big thank you to anyone who has helped:
- adi928
- jaeger-2601
- tijko
- joanbono and Xumeiquer for the rules from yara-forensics

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
rules		rules
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
ham.conf		ham.conf
hamburglar.py		hamburglar.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Hamburglar

Setup

Operation

Settings

The Hamburglar can find

Example output:

Contributions

About

Releases

Packages

Contributors 8

Languages

License

needmorecowbell/Hamburglar

Folders and files

Latest commit

History

Repository files navigation

The Hamburglar

Setup

Operation

Settings

The Hamburglar can find

Example output:

Contributions

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 8

Languages

Packages