The Big Data Management Platform

Got large volumes of stored big data? Then you’ve got big opportunities. Got Personally Identifiable Information (PII)? Then you’ve got big challenges.

Your stored data has a wealth of information, a vast, largely untapped resource. It should be shared, analyzed and used to improve the way you do business.

But to do that you need a system that protects the privacy of all, petabytes against security breaches, while still making it easy to use. You’ve got to ensure that the right people see the right data at the right time. And the wrong ones don’t.

With PHEMI’s sophisticated Data Central Platform, privacy concerns are alleviated and you can concentrate on the breakthrough insights, innovative processes, and genuine AHA! moments that arise from sharing all that deep, rich information.

The Ideal Architecture for Sensitive Big Data
– Privacy By Design

Play Video

See All In A Single Pane Of Glass


Protect and leverage sensitive big data

Personally Identifiable Information (PII) is a minefield for Big Data. Whether it’s HIPAA, GDPR, or CCPA, the presence of PII in a data set can render it virtually unusable. PHEMI lets you protect data with a granularity down to a single cell in a grid, all without impacting performance. Make your business realize the value of its data, regardless of sensitivity.

Simplify and get to insights faster

Is your business integrating tools, or delivering outcomes? Most big data platforms are a Rube Goldberg assembly of tools that only work well for a handful of insiders. PHEMI’s mission is to make big data simple. It’s a turnkey platform, managing the full data lifecycle from acquisition and cataloging, through to transformation into analytics-ready data sets. PHEMI’s high-performance data pipelines and automatic data governance capabilities mean that analysts spend their time on discovery, not data preparation.

Lower the cost of big data

Too many licenses and maintenance contracts? Tired of spending money on outside consultants? Not sure how to leverage the cloud? PHEMI can help. One vendor can deliver it all, giving you white-glove service and accountability. Our hybrid cloud architecture insulates you from the differences between on-premise and in-cloud big data. It lets you find your optimum balance of resources: splitting storage, compute and governance into independent units that you can deploy where they are needed.

Manage and empower users

Is working with your big data an exercise in bureaucracy? The PHEMI platform empowers data scientists and data engineers to build their own pipelines, but with guardrails, so they can’t access data they shouldn’t see, or run jobs that adversely affect everyone. Our single pane of glass console and graphical programming paradigm makes building data pipelines easy. The platform handles all of the bookkeeping for them, auditing every operation and providing full provenance for every data set. It even handles file versioning–the holy grail of big data management!


Privacy designed down to the data cell level

PHEMI is the only data lake that incorporates privacy by design, an approach that makes privacy a first-class consideration, not an afterthought. PHEMI labels every element of data with metadata, which can be used for access control. Queries are governed by built-in Attribute-Based Access Control (ABAC), the same approach used by the military. ABAC combines attributes of a subject (such as a user), and the metadata bound to the data element, evaluating this against a policy rule set. ABAC is the only way to scale the access needs of a diverse community across vast data sets. Finally, strong encryption of all data ensures that nobody can end-run the system and gain access to sensitive data sets.

Advanced de-identification of PII

Extract value from sensitive data sets by de-identifying Personally Identifiable Information (PII). PHEMI ships with a rich set of de-identification functions, allowing you to mask, round, tokenize, or encrypt potentially sensitive fields. We also include evaluation functions, such as k-anonymity, that can evaluate a data set and return a score describing the risk of re-identification. And if you have a custom algorithm you want to use, it’s simple to integrate it into a data pipeline.

The richest set of prebuilt connectors

PHEMI comes with over 200 pre-built connectors to data sources. Use them to ingest data into the PHEMI platform, or export transformed data sets out into destinations. Got an unusual application? Use our graphical framework to rapidly build a custom connector to integrate data from all the participants in your network.

All data, even the weird stuff

PHEMI is a privacy-preserving data lake for all of your data. Our schema-on-read model lets you approach your data without bias. Structured data? Check the box. Unstructured? Sure thing. Specialized data sets like genomic data? You bet—it’s in our DNA.

Automated versioning and governance

The DevOps movement taught the world the value of automating toolchains. If you take out the redundant tasks, developers can focus on delivering value, hundreds of times per day. Think of PHEMI as DevOps for sensitive data. Acquisition, cataloging, transformation, and analysis—each step is incorporated into an automated, high-performance pipeline that lets your analysts focus on delivering insights. Every step is governed and audited automatically so that only the right people see the right data at the right time, no matter how complex the transform chain.

Data science and analytics tool integration

PHEMI is a privacy-first data lake that integrates with your favorite tools. Love Apache Spark? So do we: PHEMI shows up as a Spark data source you can integrate directly into your Spark jobs. Are you a notebook whiz? PHEMI is with you, page by page. Is your business built on traditional BI tools like Tableau? We have an ODBC connector to support your every query. But rest assured, every attempt to access data is logged and protected under our privacy-preserving analytics model. Stop worrying about data falling into the wrong hands, and get analyzing.

Unlock The Value From Your Big Data

We work across multiple healthcare and financial industries, helping organizations tackle all their big data management challenges with a fully-integrated platform that works on-premise, in the cloud, or even in a hybrid cloud.


Our Clients

Reliable Partners
You Can Trust

We work with industry-leading partners to deliver end-to-end solutions for your most complex Big Data challenges.

Meet our Partners

Latest News Exciting You Tube Channel Coming Soon!

Get The News


Managing metadata is a major component of ...

Start Changing the
World With Your Big Data Today

Download the e-guide
Download the e-guide
Talk To A Data Specialist
Talk To A Data Specialist


Sign Up For Everything Big
Data-Related:Tips, White Papers, Opinion Pieces, Webinar Invitations & News

Email use governed by our Privacy Policy

Sign up for Big Data Newsletter