Browse Definitions :
Definition

Presto

Contributor(s): Matthew Haughn

Presto is a free and open source distributed SQL query engine designed for the demands of big data.

Presto can run analytic queries on data ranging from gigabytes to petabytes, which enables it to search huge data warehouses. Presto offers speeds close to those of commercial solutions without excessive hardware requirements.

Presto was purpose-designed and coded to run interactive analytical searches swiftly and process results as quickly as a commercial data warehouse. Presto can scale up to the largest requirements, dealing with the 300PB size of Facebook’s massive data warehouse while also querying multiple data sources. Presto queries data where it is resident and supports Hive, Cassandra, relational databases and proprietary data stores.

Data analysts use Presto for its fast response times, from a less than a second to minutes. Facebook uses Presto themselves: Over 1000 Facebook employees use Presto daily, to run more than 30,000 queries. On average, the queries of Facebook employees scan through over a petabyte of data every day.

This was last updated in December 2017

Continue Reading About Presto

Start the conversation

Send me notifications when other members comment.

Please create a username to comment.

-ADS BY GOOGLE

File Extensions and File Formats

SearchCompliance

  • risk management

    Risk management is the process of identifying, assessing and controlling threats to an organization's capital and earnings.

  • compliance as a service (CaaS)

    Compliance as a Service (CaaS) is a cloud service service level agreement (SLA) that specified how a managed service provider (...

  • data protection impact assessment (DPIA)

    A data protection impact assessment (DPIA) is a process designed to help organizations determine how data processing systems, ...

SearchSecurity

  • spyware

    Spyware is a type of malicious software -- or malware -- that is installed on a computing device without the end user's knowledge.

  • application whitelisting

    Application whitelisting is the practice of specifying an index of approved software applications or executable files that are ...

  • botnet

    A botnet is a collection of internet-connected devices, which may include PCs, servers, mobile devices and internet of things ...

SearchHealthIT

SearchDisasterRecovery

  • business continuity plan (BCP)

    A business continuity plan (BCP) is a document that consists of the critical information an organization needs to continue ...

  • disaster recovery team

    A disaster recovery team is a group of individuals focused on planning, implementing, maintaining, auditing and testing an ...

  • cloud insurance

    Cloud insurance is any type of financial or data protection obtained by a cloud service provider. 

SearchStorage

  • DRAM (dynamic random access memory)

    Dynamic random access memory (DRAM) is a type of semiconductor memory that is typically used for the data or program code needed ...

  • RAID 10 (RAID 1+0)

    RAID 10, also known as RAID 1+0, is a RAID configuration that combines disk mirroring and disk striping to protect data.

  • PCIe SSD (PCIe solid-state drive)

    A PCIe SSD (PCIe solid-state drive) is a high-speed expansion card that attaches a computer to its peripherals.

Close