HDFS

Redirect to:

  • Apache Hadoop#HDFS
  • To a section: This is a redirect from a topic that does not have its own page to a section of a page on the subject. For redirects to embedded anchors on a page, use {{R to anchor}} instead.

Hadoop Distributed File System is a distributed file system that handles large data sets running on commodity hardware (Ishengoma, 2013). It is used to scale a single Apache Hadoop cluster to hundreds (and even thousands) of nodes. HDFS is one of the major components of Apache Hadoop, the others being MapReduce and YARN.