Big Data on AWS - Tips and Tricks

This is my list of hints and tips for this course. It’s markdown so you can save it, access it or store it anywhere. I might also give you other links that are course specific. I’ll add specific answers to questions I get during the course. I’ll share it with everyone.

Your Instructor

Administrivia

We need to jump through some hoops to get access to the labs, notes and my hints and tips. Be consistent with the email address you use for all sites. There are three seperate sites you need to access and one bitly link which is this page:

  • Join or login to https://www.aws.training/ to ensure your training and certifications are captured. No we don’t spam you or sell your details.
  • Access Qwiklab (yes it is spelt INCORRECTLY)
    • aws.qwiklabs.com for the labs in this class
    • run.qwiklabs.com for outside of the class or to do other labs at your own pace.
    • NOTE: Some are free others require course credits. Also check out the AWS Professional Developer Series of MOOCs on edX https://www.edx.org/aws-developer-professional-series
  • Access the course notes and slides. You’ll receive two emails. One confirming your attendance at this course and with the following links. The download link seems broken. You can download apps for phones, tablets and laptops. Or use your browser.
  • www.vitalsource.com look for a signup link and download link. Or just go to https://evantage.gilmoreglobal.com/#/user/signin
  • Once you’ve logged into Vitalsource (aka Bookshelf, Gilmore, eVantage) you can redeem your unique course materials code (in a seperate email) and update your book list. You should see a lab guide and student guide for Big Data on AWS, version 3.8 . The student guide is the powerpoint decks and notes and the lab guide is the step by step instructions for the labs. The lab guide is included in the labs so this document is somewhat redundant. You can download the Vitalsource Bookshelf app for Windows, Mac, IoS and Android at https://support.vitalsource.com/hc/en-us/articles/201344733-Bookshelf-Download-Page
  • You can print the student and lab guides to pdf from the app.

Academic papers

Some Fundamentals

Cool links

Best Practice

Migration Best Practice

Networking Links

Compute links

  • aws cli wait command will time out after 120 checks. They’re labled as failed checks in the documentation but they aren’t strictly a failure. Whatever the wait command is waiting on never reaches the wait state if it times out. The timeout period can vary so check the documentation for the service and wait state you’re interested in. Here’s EBS https://docs.aws.amazon.com/cli/latest/reference/ec2/wait/snapshot-completed.html
  • Here’s a useful sortable table of EC2 instance types, sizes and specifications. Not your add columns like available for EMR which makes instance selection very simple. https://ec2instances.info/ . Also check out http://instancetyp.es too.

Serverless Links

Using Amazon SNS and AWS Lambda together in serverless event driven architectures:

Storage Links

Database Links

Streaming links

Athena Links

Redshift Links

EMR Links

DynamoDB links

Glue links

Security

  • ANT346 – [BS] Lock It Down: Configure End-to-End Security & Access Control on Amazon EMR TODO youtube link here

Big Data architectures

Time Series Forecasting

Java Serverless Links

Self paced Learning and Building

Continue reading articles in my Amazon Web Services series