Browse Definitions :
Definition

Presto

Presto is a free and open source distributed SQL query engine designed for the demands of big data.

Presto can run analytic queries on data ranging from gigabytes to petabytes, which enables it to search huge data warehouses. Presto offers speeds close to those of commercial solutions without excessive hardware requirements.

Presto was purpose-designed and coded to run interactive analytical searches swiftly and process results as quickly as a commercial data warehouse. Presto can scale up to the largest requirements, dealing with the 300PB size of Facebook’s massive data warehouse while also querying multiple data sources. Presto queries data where it is resident and supports Hive, Cassandra, relational databases and proprietary data stores.

Data analysts use Presto for its fast response times, from a less than a second to minutes. Facebook uses Presto themselves: Over 1000 Facebook employees use Presto daily, to run more than 30,000 queries. On average, the queries of Facebook employees scan through over a petabyte of data every day.

This was last updated in December 2017

Continue Reading About Presto

SearchCompliance
  • risk reporting

    Risk reporting is a method of identifying risks tied to or potentially impacting an organization's business processes.

  • risk profile

    A risk profile is a quantitative analysis of the types of threats an organization, asset, project or individual faces.

  • risk appetite

    Risk appetite is the amount of risk an organization is willing to take in pursuit of objectives it deems have value.

SearchSecurity
SearchHealthIT
SearchDisasterRecovery
  • What is risk mitigation?

    Risk mitigation is a strategy to prepare for and lessen the effects of threats faced by a business.

  • fault-tolerant

    Fault-tolerant technology is a capability of a computer system, electronic system or network to deliver uninterrupted service, ...

  • synchronous replication

    Synchronous replication is the process of copying data over a storage area network, local area network or wide area network so ...

SearchStorage
  • cloud archive

    A cloud archive is storage as a service for long-term data retention.

  • cache

    A cache -- pronounced CASH -- is hardware or software that is used to store something, usually data, temporarily in a computing ...

  • archive

    An archive is a collection of data moved to a repository for long-term retention, to keep separate for compliance reasons or for ...

Close