Browse Definitions :
Definition

Presto

Contributor(s): Matthew Haughn

Presto is a free and open source distributed SQL query engine designed for the demands of big data.

Presto can run analytic queries on data ranging from gigabytes to petabytes, which enables it to search huge data warehouses. Presto offers speeds close to those of commercial solutions without excessive hardware requirements.

Presto was purpose-designed and coded to run interactive analytical searches swiftly and process results as quickly as a commercial data warehouse. Presto can scale up to the largest requirements, dealing with the 300PB size of Facebook’s massive data warehouse while also querying multiple data sources. Presto queries data where it is resident and supports Hive, Cassandra, relational databases and proprietary data stores.

Data analysts use Presto for its fast response times, from a less than a second to minutes. Facebook uses Presto themselves: Over 1000 Facebook employees use Presto daily, to run more than 30,000 queries. On average, the queries of Facebook employees scan through over a petabyte of data every day.

This was last updated in December 2017

Continue Reading About Presto

Start the conversation

Send me notifications when other members comment.

Please create a username to comment.

-ADS BY GOOGLE

File Extensions and File Formats

Powered by:

SearchCompliance

  • data governance policy

    A data governance policy is a documented set of guidelines for ensuring that an organization's data and information assets are ...

  • risk management

    Risk management is the process of identifying, assessing and controlling threats to an organization's capital and earnings.

  • compliance as a service (CaaS)

    Compliance as a Service (CaaS) is a cloud service service level agreement (SLA) that specified how a managed service provider (...

SearchSecurity

  • Advanced Encryption Standard (AES)

    The Advanced Encryption Standard, or AES, is a symmetric block cipher chosen by the U.S. government to protect classified ...

  • intrusion detection system (IDS)

    An intrusion detection system (IDS) is a system that monitors network traffic for suspicious activity and alerts when such ...

  • Secure Shell (SSH)

    SSH, also known as Secure Shell or Secure Socket Shell, is a network protocol that gives users, particularly system ...

SearchHealthIT

SearchDisasterRecovery

SearchStorage

  • cache memory

    Cache memory, also called CPU memory, is high-speed static random access memory (SRAM) that a computer microprocessor can access ...

  • capacity management

    Capacity management is the broad term describing a variety of IT monitoring, administration and planning actions that are taken ...

  • cloud storage

    Cloud storage is a service model in which data is transmitted and stored on remote storage systems, where it is maintained, ...

Close