Search: mapreduce

  • MapReduce

    MapReduce is a software framework that allows developers to write programs that process massive amounts of unstructured...

    View Related Content
  • Amazon Elastic MapReduce (Amazon EMR)

    Amazon Elastic MapReduce (EMR) is an Amazon Web Service (AWS) for data processing and analysis.

    View Related Content
  • Apache Pig

    Apache Pig is an open-source technology that offers a high-level mechanism for parallel programming of MapReduce jobs...

    View Related Content
    • Apache Pig

      Apache Pig is an open-source technology that offers a high-level mechanism for parallel programming of MapReduce jobs to be exe...

    • Three Approaches to Data Analysis with Hadoop

      Access this white paper to get an analysis of large data sets using three different tools that are part of the Hadoop ecosystem...

    • Apache YARN helps to knit more mature Hadoop offering

      Apache YARN spun a tale of its own at Hortonworks' Hadoop Summit, but it wasn't the only big data announcement. The Data Mill r...

  • SQL-on-Hadoop

    SQL-on-Hadoop is a class of analytical application tools that combine established SQL-style querying with newer Hadoop data fra...

    View Related Content
  • Apache Hadoop YARN (Yet Another Resource Negotiator)

    Apache Hadoop YARN (short, in self-deprecating fashion, for Yet Another Resource Negotiator) is a cluster management technology...

    View Related Content
  • Apache Spark

    Apache Spark is an open-source parallel processing framework that enables users to run large-scale data analytics applications ...

    View Related Content
  • Apache Hive

    Apache Hive is an open-source data warehouse system for querying and analyzing large datasets stored in Hadoop files. Hadoop is...

    View Related Content
  • Hadoop

    Hadoop is a free, Java-based programming framework that supports the processing of large data sets in a distributed computing e...

    View Related Content
  • Hadoop 2

    Apache Hadoop 2 is the second iteration of the Hadoop framework for distributed data processing.  Hadoop 2 adds support for ru...

    View Related Content
  • big data management

    Big data management is the organization, administration and governance of large volumes of both structured and unstructured data.

    View Related Content

Search Again: