Accessible Hadoop Report

This report identifies hosts that are running Hadoop and have either the NameNode or DataNode web interfaces running and accessible to the world on the Internet.

At a minimum, this can allow for information-gathering against the target organization. In other instances, it may allow a miscreant to manipulate the Hadoop instance.

For more details behind the scan methodology and a daily update of global Hadoop scan statistics please visit our dedicated Open Hadoop scan page.

For more information on our scanning efforts, check out our Internet scanning summary page.


  • timestamp
    Time that the IP was probed in UTC+0
  • ip
    The IP address of the device in question
  • protocol
    Protocol that the response came on (always TCP)
  • port
    Port that the response came from (50070/TCP or 50075/TCP)
  • hostname
    Reverse DNS name of the device in question
  • version
    Running version of Hadoop
  • tag
    Will always be hadoop
  • asn
    ASN of where the device in question resides
  • geo
    Country where the device in question resides
  • region
    State / Province / Administrative region where the device in question resides
  • city
    City in which the device in question resides
  • naics
    North American Industry Classification System Code
  • sic
    Standard Industrial Classification System Code
  • server_type
    The type of service that responded; this value is either "namenode" (response to probe on port 50070/TCP) or "datanode" (response to probe on port 50075/TCP); fields that contain data from only one type of response are denoted with either "namenode" or "datanode"
  • clusterid
    Unique ID of the cluster
  • total_disk
    The total amount of disk space available to Hadoop (in bytes) (namenode responses only)
  • used_disk
    The amount of disk space used by Hadoop (in bytes) (namenode responses only)
  • free_disk
    The amount of disk space free to Hadoop (in bytes) (namenode responses only)
  • livenodes
    The first live datanode name listed in the response (namenode responses only)
  • namenodeaddress
    Trivial hostname of the NameNode that the DataNode is associated with (datanode responses only)
  • volumeinfo
    The path that the hadoop data is stored in (datanode responses only)


"2017-09-13 02:06:05","",50070,,"2.7.3, rbaa91f7c6bc9cb92be5982de4719c1c8af91ccff",15296,"CA","ALBERTA","CALGARY",0,0,"namenode","CID-64471a53-60cb-4302-9832-92f321f111fe",41567956992,53248,25160089600,"edmonton:50010",,
"2017-09-13 02:06:05","",50070,,"2.5.0-cdh5.3.5, rc15da49be59e6ee467549c5ad9e99e1d4d31f972 ",14061,"US","NEW JERSEY","CLIFTON",0,0,"namenode","CID-5aa27504-96d6-4bde-9da4-c5498ac57d5c",169068572672,24576,153751240704,"s201709-8.localdomain",,
"2017-09-13 02:07:48","",50075,,"",8075,"US","IOWA","DES MOINES",334111,357101,"datanode","CID-771bae52-9e4f-4ec4-bc1a-c867585751f0",,,,,"","/hadoop/hdfs/data/current"

Our 137 Report Types