Katana

project discovery crawler

#2024 One liner

katana -u http.txt -d 20 -ps -pss waybackarchive,commoncrawl,alienvault -kf -jc -fx -ef woff,css,png,svg,jpg,woff2,jpeg,gif,svg -o allurls.txt
sudo git clone https://github.com/projectdiscovery/katana.git

OR

go install github.com/projectdiscovery/katana/cmd/katana@latest


katana -list http_hosts.txt (list)

katana -u https://target.com (single target)

katana -u https://target.com -silent -em js,jsp,json (extensions)

katana -u https://target.com -kf robotstxt sitemapxml (files)

katana -u https://target.com -headless (nuff said)

katana -u https://target.com -jsonl | jq  (json parsing)

katana -u https://target.com -ct 10 (crawls for 10 secs then stops)

katana -u https://target.com -jc (javascript file parsing)

katana -u https://target.com -f qurl (url query params)

katana -u https://target.com -f qpath (output only path in query)

katana -u https://target.com -f file (output only filetype)

katana -u https://target.com -f key (parameter keys in url)

katana -u https://target.com -f dir (directories)


#Crawling
subfinder -d target.com -silent -all | httpx -silent | katana -d 5 -silent | grep -iE '\.js'| grep -iEv '(\.jsp|\.json)'
subfinder -d target.com -silent -all | httpx -silent | katana -d 5 -silent -em js,jsp,json

Last updated