2 When a principal attempts to run a query in Amazon EMR on data from Lake

2 when a principal attempts to run a query in amazon

This preview shows page 248 - 251 out of 395 pages.

2. When a principal attempts to run a query in Amazon EMR on data from Lake Formation, Amazon EMR requests temporary credentials for data access from AWS Lake Formation. 3. Lake Formation returns temporary credentials, allowing data access. 4. Amazon EMR sends the query request to obtain data from Amazon S3. 5. Amazon EMR filters and returns the results based on the user permissions defined in Lake Formation. Amazon EMR Components Amazon EMR enables fine-grained access control with Lake Formation by using the following components: Proxy agent - The proxy agent is based on Apache Knox. It receives SAML-authenticated requests from users and translates SAML claims to temporary credentials. It also stores the temporary credentials in 242
Image of page 248
Amazon EMR Management Guide Conceptual Overview of Amazon EMR Integration with Lake Formation the secret agent. The proxy agent runs on the master node as the knox system user and writes logs to the /var/log/knox directory. Secret agent - The secret agent securely stores secrets and distributes secrets to other EMR components or applications. The secrets can include temporary user credentials, encryption keys, or Kerberos tickets. The secret agent runs on every node in the cluster and uses Lake Formation and AWS Glue APIs to retrieve temporary credentials and AWS Glue Data Catalog metadata. The secret agent runs as the emrsecretagent user, and writes logs to the /emr/secretagent/log directory. The process relies on a specific set of iptables rules to function. It is important to ensure iptables is not disabled, and, if you customize iptables configuration, the nat table rules must be preserved and left unaltered. Record server - The record server receives requests for accessing data. It then authorizes requests based on temporary credentials and table access control policies distributed by the secret agent. The record server reads data from Amazon S3 and returns column-level data that the user is authorized to access. The record server runs on every node in the cluster as the emr_record_server user and writes logs to the /var/log/emr-record-server directory. Note Spark SQL has been integrated with each of these components, allowing Spark SQL jobs to read and process data that are protected by Lake Formation policies. Architecture of SAML-Enabled Single Sign-On and Fine-Grained Access Control The following diagram illustrates the architecture of SAML-enabled single sign-on and fine-grained access control with Lake Formation and Amazon EMR. 243
Image of page 249
Amazon EMR Management Guide Conceptual Overview of Amazon EMR Integration with Lake Formation 1. An unauthenticated user uses the proxy agent to access EMR notebook or Zeppelin. The user is redirected to your organization’s Identity Provider (IdP) sign-on page. 2. The IdP verifies the user's identity in your organization.
Image of page 250
Image of page 251

You've reached the end of your free preview.

Want to read all 395 pages?

  • Spring '12
  • LauraParker
  • Amazon Web Services, Amazon Elastic Compute Cloud

What students are saying

  • Left Quote Icon

    As a current student on this bumpy collegiate pathway, I stumbled upon Course Hero, where I can find study resources for nearly all my courses, get online help from tutors 24/7, and even share my old projects, papers, and lecture notes with other students.

    Student Picture

    Kiran Temple University Fox School of Business ‘17, Course Hero Intern

  • Left Quote Icon

    I cannot even describe how much Course Hero helped me this summer. It’s truly become something I can always rely on and help me. In the end, I was not only able to survive summer classes, but I was able to thrive thanks to Course Hero.

    Student Picture

    Dana University of Pennsylvania ‘17, Course Hero Intern

  • Left Quote Icon

    The ability to access any university’s resources through Course Hero proved invaluable in my case. I was behind on Tulane coursework and actually used UCLA’s materials to help me move forward and get everything together on time.

    Student Picture

    Jill Tulane University ‘16, Course Hero Intern

Stuck? We have tutors online 24/7 who can help you get unstuck.
A+ icon
Ask Expert Tutors You can ask You can ask ( soon) You can ask (will expire )
Answers in as fast as 15 minutes
A+ icon
Ask Expert Tutors