• We Are
    Data Scientists
    Hadoop Experts
    Problem Solvers

Our Services

Insight Development

We transform raw data into actionable insights using analytics & Data Science

Data Exploration

We help companies understand patterns, trends and relationships in their data that can produce real ROI

Hadoop Strategy

We help companies develop a strategy for implementing Hadoop & building analytics with Data Science

  

Legacy Migration

We migrate existing application environments from traditional data warehousing/BI platforms to the Hadoop Ecosystem

Hadoop Jumpstart

We enrich the knowledge canvas of company resources and render them efficient in the Hadoop Ecosystem

Hadoop Implementation

We help companies implement production Hadoop environments complete with security, high-availability and multi-tenancy

Latest From Our Blog

Written by Spry

In many of our use cases, the data we work with does not come ready to be fed into an analytics workflow. It must first be ingested and prepared. This includes renaming and/or reordering fields, changing data types, filtering out invalid values, and combining different parts of the same data source. In this post, we will be covering how to perform these steps using a Data Pipeline tool called Alteryx. We will walk through a workflow used for one of our clients.

Read more

Written by Spry

A SELECT or COUNT query in Hive will be executed as a MapReduce job even if the queries are made against a small table or dataset.  Imagine that you want to execute one of these queries which should only take a few seconds... for example, the situation where the set up and tear down of the Hadoop job probably takes longer than the actual work portion of the job.  Also imagine that another user has a complex and long-running job already executing on the cluster.  Bad news for your job.  If you're using mostly Hadoop default settings for the YARN scheduling algorithm, it's possible that your simple job won't be executed until the other is finished.

Read more

Written by Spry

When building a dashboard in Tableau, the analyst may want to filter many worksheets with a single filter selection.

Read more

About Spry

We are a high standards Data Science & Hadoop firm solving complex problems for Fortune 500 companies

Keep In Touch

  •   info (@) spryinc.com
  •   +443.212.5072
  •   53 Loveton Circle
    Suite 114
    Sparks, MD 21152