Hadoop Security Basics (In Under 5 Minutes)

August 9, 2019

Data Basics Catie Grasso

Between growing data and analytics teams, more collaborative projects, and GDPR, security and governance are becoming bigger issues. As organizations' analytical maturity improves, it's increasingly critical that business and technical users alike understand data best practices.

distributed systems hadoop architecture basics

Who is allowed to use what data? And what are they allowed to do with it? Big data brings Hadoop clusters, which use Kerberos and ACLs and LDAPs, all to manage rules and access rights. But what are those exactly, and how do they work?

When working in data and analytics, communicating with IT professionals who use technical vocabulary is commonplace. What is a yarn, and what is a metastore? What is the difference between Sentry and Ranger? (Hint: not much.) What does Apache Knox do? What do we mean by impersonation? And what do Hive, Pig, Impala, Tez, Spark, and MapReduce all have in common?

We've created a glossary with some key terms to think about when discussing Hadoop security.

Hadoop Security Basics (In Under 5 Minutes)

You May Also Like

From Bedside to Backend: Making Sense of Real-World Health Data

Low-Code & No-Code: What’s Real, What’s Hype, & Who Actually Delivers?

AI & Human Connection: Empowering Businesses, Elevating People

[Infographic] 8 Steps to Drive AI Literacy Success

Hadoop Security Basics (In Under 5 Minutes)

Go Further: Data Architecture Crash Course

Subscribe to the Dataiku Blog

You May Also Like

From Bedside to Backend: Making Sense of Real-World Health Data

Low-Code & No-Code: What’s Real, What’s Hype, & Who Actually Delivers?

AI & Human Connection: Empowering Businesses, Elevating People

[Infographic] 8 Steps to Drive AI Literacy Success