SHARE
Facebook X Pinterest WhatsApp

New Apache Project ‘Drill’ Aims to Speed Up Hadoop Queries

Aug 21, 2012

Finding much faster ways to complete Hadoop queries for enterprise users is the aim of "Drill," the latest open-source project being undertaken by the Apache Software Foundation.

Drill has been established as an Apache Incubator Project, opening its continued development up to software engineers around the world, according to Tomer Shiran, director of product management for Hadoop vendor, MapR Technologies, which is one of the backers of the Apache Drill project.

The Drill project will work to create an open-source version of Google’s Dremel Hadoop tools, which Google uses to speed up its internal use of its Hadoop data analysis tools.

"We’ve spent quite a few months talking to lots of organizations and potential users of Drill and to our customer base as well," said Shiran, who is a founding member of the Drill project. "We wanted to put this out there as an open-source project, rather than just keep it within MapR for our use alone."

Drill aids Hadoop users by enabling vastly quicker queries of huge data sets, said Shiran.

"With Drill, you’ll be able to get really fast responses," he said. Users will be able to get responses within one second, which is a key difference from other tools that are available today, he added.

As it presently works as it was designed, Hadoop does batch processing of large data sets. Drill will improve on that method by doing "interactive analysis" that can find the required answers in the data more quickly, said Shiran. "Interactive analysis is much faster than batch processing."

The need for tools like Drill has been inspired by always-increasing user requirements, he said. "People have been doing queries in Hadoop, but since it doesn t return answers to you within a few seconds, it has limitations."

Using Drill, users will be able to do ad hoc analysis and get faster responses, whether they are seeking anomalies, data trends or even network intrusions, according to Shiran. "With all of those things, you’re going to have to get a pretty fast response or by the time you do figure it out, it’s going to be old news."

The nascent Drill open-source project is currently in development and includes a variety of companies and individuals who are working on it right now. "A broad-based effort will be working on this," said Shiran. "There’s quite a few people actively developing on the project now, so I don’t think it will be a long time before we have an early version released."

To read the original eWeek article, click here: New Apache Project ‘Drill’ Aims to Speed Up Hadoop Queries

Recommended for you...

What do Amazon, Microsoft, Meta, and IBM Have in Common? Tape Storage
Drew Robb
Aug 15, 2022
What Does Quantum Computing Mean for IT?
Devin Partida
Aug 11, 2022
Solving the Video Surveillance Retention Challenge 
Drew Robb
Jul 28, 2022
Top 6 IT Challenges in Healthcare
Lauren Hansen
Jun 21, 2022
CIO Insight Logo

CIO Insight offers thought leadership and best practices in the IT security and management industry while providing expert recommendations on software solutions for IT leaders. It is the trusted resource for security professionals who need to maintain regulatory compliance for their teams and organizations. CIO Insight is an ideal website for IT decision makers, systems integrators and administrators, and IT managers to stay informed about emerging technologies, software developments and trends in the IT security and management industry.

Property of TechnologyAdvice. © 2025 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.