Analytics/Cluster/Hadoop
The Analytics Hadoop Cluster consists of the following systems:
- 1 master node (NameNode, ResourceManager, etc.)
- 1 standby NameNode
- 22 x worker nodes (DataNode, NodeManager).
The hardware infrastructure page has the system description and configurations.
We run Cloudera's CDH5.
Management links
See Analytics/Cluster/Access for instructions on setting up a SOCKS proxy to connect to these internally-hosted web interfaces.
- http://analytics1010.eqiad.wmnet:8088/ - Hadoop YARN Job Browser
- http://analytics1010.eqiad.wmnet:50700/ - Hadoop Distributed File System NameNode Status