πHow to Get Unique Subdomains on Large scope
source: https://medium.com/@h4x0r_dz/the-right-way-to-get-unique-subdomains-on-large-scope-899f834e702c
how to Get The Uniq Subdomains on a big Subdomains list. to print unique ones only
this writeup will be a short one forry about that xD, during My hunting on big companies such as Facebook, google, amazon β¦etc, and In the subdomain enumeration step, I found that I get so many same subdomains.
for example :
these subdomains have the same web application, and it is a long journey to visit and check all these subdomains one by one, (and it worth nothing ).
so instead we can use a trick to filter similar subdomains. and print only unique one , this can be done using httpx and awk built-in tool
Iβm going to use twitter.com as an example :
subdomain enumeration
the first step as expected is to extract the subdomains, we are going to enumerate all the subdomains and save them in a Text file.
you can use your favorite for subdomain enumeration, and save the result in text file :
httpx to enumerate live subdomains
now we are going to use httpx tool with these options :
now we are going to use httpx tool with these options :
-l subdomains.txt: This flag specifies the input source for subdomains. -sc: This flag likely stands for βstatus code.β -title: This flag instructs the tool to extract and display the titles -cl: This flag stands for βcontent length.β -wc: This flag stands for βword count.β -td: display technology in use based on wappalyzer dataset | tee httpx2.txt: This part of the command is using the tee command, which is used to capture the standard output (stdout) of the preceding command and write it to a file. In this case, the output of the httpx command is being captured and written to a file named httpx2.txt.
as you can see, there are so many subdomains has the same HTTP status code & Title & content length.
awk to filter the duplicate subdomains
now after getting the (maybe the same web application on different subdomains ) on our text list , we need to filter theme and print only the unique ones and doing our pen-testing on them . to do this we need to use a tool called awk
to do this we need to use a tool called awk
we need to filter the list of subdomains based on status code & content length β¦.
full command
Fβ[β: This flag sets the field separator for awk as an opening square bracket [. This means that awk will split input lines into fields based on the specified separator.
-Fβ[β: This flag sets the field separator for awk as an opening square bracket [. This means that awk will split input lines into fields based on the specified separator.
β!seen[$2, $3, $4, $5]++β: This is the awk script that defines the processing to be applied to each input line.
this method is not working properly, but it still works fine on the large scope for me xD.
Last updated