Browse Definitions :
Definition

Presto

Contributor(s): Matthew Haughn

Presto is a free and open source distributed SQL query engine designed for the demands of big data.

Presto can run analytic queries on data ranging from gigabytes to petabytes, which enables it to search huge data warehouses. Presto offers speeds close to those of commercial solutions without excessive hardware requirements.

Presto was purpose-designed and coded to run interactive analytical searches swiftly and process results as quickly as a commercial data warehouse. Presto can scale up to the largest requirements, dealing with the 300PB size of Facebook’s massive data warehouse while also querying multiple data sources. Presto queries data where it is resident and supports Hive, Cassandra, relational databases and proprietary data stores.

Data analysts use Presto for its fast response times, from a less than a second to minutes. Facebook uses Presto themselves: Over 1000 Facebook employees use Presto daily, to run more than 30,000 queries. On average, the queries of Facebook employees scan through over a petabyte of data every day.

This was last updated in December 2017

Continue Reading About Presto

Start the conversation

Send me notifications when other members comment.

Please create a username to comment.

-ADS BY GOOGLE

File Extensions and File Formats

SearchCompliance

  • regulatory compliance

    Regulatory compliance is an organization's adherence to laws, regulations, guidelines and specifications relevant to its business...

  • privacy compliance

    Privacy compliance is a company's accordance with established personal information protection guidelines, specifications or ...

  • data governance policy

    A data governance policy is a documented set of guidelines for ensuring that an organization's data and information assets are ...

SearchSecurity

SearchHealthIT

  • telemedicine (telehealth)

    Telemedicine is the remote delivery of healthcare services, such as health assessments or consultations, over the ...

  • Project Nightingale

    Project Nightingale is a controversial partnership between Google and Ascension, the second largest health system in the United ...

  • medical practice management (MPM) software

    Medical practice management (MPM) software is a collection of computerized services used by healthcare professionals and ...

SearchDisasterRecovery

SearchStorage

  • megabytes per second (MBps)

    Megabytes per second (MBps) is a unit of measurement for data transfer speed to and from a computer storage device.

  • zettabyte

    A zettabyte is a unit of measurement used by technology professionals and the general public to describe a computer or other ...

  • hybrid flash array

    A hybrid flash array is a solid-state storage system that contains a mix of flash memory drives and hard disk drives.

Close