Apache Ranger: Fine-Grained Access Control. In this course, we study end-to-end Apache Ranger authorization policies and integration with Hadoop components like Hive, HDFS, Apache Atlas, Presto, etc.
What you’ll learn
- Apache Ranger Components & Architecture
- Complete Ranger Hive Integration with real usecases
- Understanding end-to-end Ranger HDFS Plugin
- Familiar with Hive and HDFS
- Good to have AWS EMR hands-on
What do you Learn From this Course?
In this course, we study end-to-end Apache Ranger authorization policies and integration with Hadoop components like Hive, HDFS, Apache Atlas, Presto, etc.
We first learn Ranger Components and Architecture.
Next, we install Apache Ranger 2.1.0, which is the latest stable version. Installing the Ranger seems pretty straightforward. However, Ranger requires Solr and Mysql infrastructure. Ranger is compatible with Mysql version 5.6 or 5.7. It will not support beyond these versions. These limitations complicate the ranger installation. I spent many days installing ranger. I consolidated all the steps into a single shell script. Users can download and run it without any hassle. After that, we navigate to the Apache Ranger UI and explore the featured features.
In the third section, we achieve the integration of Ranger Hive. At first, we purchased the hive cluster and installed the ranger hive plugin. We load sample data into Hive. We define policies in apache ranger for different users and we validate the policies by consulting hive. We also looked at masking and row-level filters.
In the fourth section, we delve into the Ranger HDFS integration. At first, we purchased the hive cluster and installed the HDFS Ranger plug-in.
We delved into managed hive tables and loaded sample data into HDFS. Lastly, we define the HDFS policies and verify them by querying the Hive tables.