Version: 1.0.2

Website footprinting

Web spiders

Programs designed to help in website footprinting
Methodically browse a website in search of specific information.
Information collected this way can help attackers perform social engineering attacks.

By examining the website headers, it is possible to obtain information about:
- Content-Type
- Accept-Ranges
- Connection Status
- Last-Modified Information
- X-Powered-By Information
  - E.g. ZendServer 8.5.0,ASP.NET
- Web Server Information
  - Server header can give you e.g. Apache Server on CentOS
You can also analyze what website pulls
- In debugging developer tool of most browsers (ctrl+shift+c) network section
- For each request you can see remote IP address, and response headers for further analysis.

Html links: href=cloudarchitecture.io
Gain insight into the file system structure
You can find e.g. a caching server and check vulnerabilities for that caching server.

Also called website mirroring
Helps in
- browsing the site offline
- searching the website for vulnerabilities
- discovering valuable information and metadata.
Can be protected with some detections based on e.g. page pull speed, behavior, known scrapers, AI.
💡 Good tool for setting up fake websites.
- E.g. manually recreate login pages
- If you control the DNS you can do a redirect.
Allows you to save social media pages with this however most are protected, and illegal to clone.
Website monitoring tools can send notifications on detected changes.
💡 Protection against fake websites
- Always check domain name for misspelling
- Make sure it's HTTPS, if it's not the data can be sniffed easily
  - Protects against someone taking over DNS
  - If the other part does not have the certificate, browser does not accept communication
- Check SSL certificate authority, if it's changing, it can prompt a question.
  - Certificates expire usually in a year.

hexdump
- Dump file as ASCII and inspect manually
- E.g. hexdump -C TEST_DOCUMENT.docx
- ❗ Not recommended as it's pretty hard to extract information from binary.
ExifTool
- Reads + writes metadata of audio, video, PDF, docs etc.
- E.g. exiftool TEST_DOCUMENT.docx would return something like Microsoft Office Word, Version: 16.0
📝 Metagoofil | Google hacking tool
- Search for files that may have metadata for a website using Google and dump their metadata.