Security Engineering on AWS - Tips and Tricks
This is my list of hints and tips for this course. It’s markdown so you can save it, access it or store it anywhere. I might also give you other links that are course specific. I’ll add specific answers to questions I get during the course. I’ll share it with everyone.
Your Instructors
- Ian Falconer https://www.linkedin.com/in/leftbrainstuff/
Administrivia
We need to jump through some hoops to get access to the labs, notes and my hints and tips. Be consistent with the email address you use for all sites. There are three seperate sites you need to access and one bitly link which is this page:
- Join or login to https://www.aws.training/ to ensure your training and certifications are captured. No we don’t spam you or sell your details.
- Access Qwiklab (yes it is spelt INCORRECTLY)
- aws.qwiklabs.com for the labs in this class
- run.qwiklabs.com for outside of the class or to do other labs at your own pace. NOTE: Some are free others require course credits. Also check out the AWS Professional Developer Series of MOOCs on edX https://www.edx.org/aws-developer-professional-series
- Access the course notes and slides. You’ll receive two emails. One confirming your attendance at this course and with the following links. The download link seems broken. You can download apps for phones, tablets and laptops. Or use your browser.
- www.vitalsource.com look for a signup link and download link. Or just go to https://evantage.gilmoreglobal.com/#/user/signin
- Once you’ve logged into Vitalsource (aka Bookshelf, Gilmore, eVantage) you can redeem your unique course materials code (in a seperate email) and update your book list. You should see a lab guide and student guide for Security Engineering on AWS, version x.y . The student guide is the powerpoint decks and notes and the lab guide is the step by step instructions for the labs. The lab guide is included in the labs so this document is somewhat redundant. You can download the Vitalsource Bookshelf app for Windows, Mac, IoS and Android at https://support.vitalsource.com/hc/en-us/articles/201344733-Bookshelf-Download-Page
- You can print the student and lab guides to pdf from the app.
Group Exercises (Instructor will decide if and when we tackle these)
Cool links
- James Hamilton, AWS SVP, talks about our infrastructure
- This should be the first AWS video you watch. Here’s the youtube video. https://www.youtube.com/watch?v=JIQETrFC_SQ There are Youtube videos from more recent ReInvents with some updates too. Here is James in 2016. It’s titled as Global Innovation at Scale. https://www.youtube.com/watch?v=uj7Ting6Ckk
- AWS Nitro System is hardware virtualization that has significant benefits over traditional software virtualization. Listen to James Hamilton dive deep on Nitro https://perspectives.mvdirona.com/2019/02/aws-nitro-system/. AWS CISO does a Deep Dive on the Nitro hypervisor and the security benefits of loosely coupling the hypervisor (or more correctly the compute management system and control plane) in a video titled AWS Live re:Inforce - Security Benefits of the EC2 Nitro Architecture https://www.youtube.com/watch?v=t_9CASbagag And Nitro is also described in detail in Amazon EC2 High Memory instances for SAP HANA: simple, flexible, powerful https://aws.amazon.com/blogs/awsforsap/amazon-ec2-high-memory-instances-for-sap-hana-simple-flexible-powerful/
- AWS re:Invent 2017: Scaling Up to Your First 10 Million Users (ARC201). This is like the Tech Essentials course in a single video. Well worth a watch. https://www.youtube.com/watch?v=w95murBkYmU
- A Day in the Life of a Billion Packets" - [http://www.youtube.com/watch?v=Zd5hsL-JNY4 and the sequel Another Day, Another Billion Packets" - [https://www.youtube.com/watch?v=3qln2u1Vr2E
- Amazon EC2 Instance Types explained in neat tabular comparisons. https://aws.amazon.com/ec2/instance-types/ . Also here’s a third party site that has a table that lets you sort on memory, network performance, cost and instance type. You can also quickly compare costs here too. https://ec2instances.info/ . Here’s a stackoverflow thread on non AWS benchmarks of different instance types. https://stackoverflow.com/questions/20663619/what-does-amazon-aws-mean-by-network-performance
- An external post about S3 data leaks. The AWS Shared Responsibility Model is key to avoiding this misconfiguration. https://aws.amazon.com/compliance/shared-responsibility-model/ Engage with your AWS Solution Architects to get your security right. CIA CIO John Edwards has publicly stated that “it’s the best decision we’ve ever made” and “It’s the most innovative thing we’ve ever done” in reference with the CIA’s partnership with AWS. Here are the links. https://fcw.com/articles/2017/06/14/cia-cloud-aws.aspx and https://www.cio.com/article/2375269/hybrid-cloud/cia-off-and-running-with-amazon-web-services.html
- Latency between AWS regions. Lot’s of good empirical data points. Note these are averages over a 24 hour period. https://www.cloudping.co/
- Latency - Lots of interesting links here. http://highscalability.com/latency-everywhere-and-it-costs-you-sales-how-crush-it
- Use the Amazon S3 Transfer Acceleration Speed Comparison tool to check the relative performance of uploading to all S3 regions over the internet and over the AWS internal network by accessing regional edge locations. https://docs.aws.amazon.com/AmazonS3/latest/dev/transfer-acceleration.html
- List of CIDR ranges of AWS regions http://ec2-reachability.amazonaws.com/
- How we handle prime day (DynamoDB) https://www.youtube.com/watch?v=83-IWlvJ__8
- Benchmark tests of EC2 versus other bare metal and cloud servers. AWS keeps innovating relentlessly. https://www.phoronix.com/scan.php?page=article&item=cloud-cpu-36&num=1
- AWS Open Guide on GitHub is a good summary of AWS Documentation https://github.com/open-guides/ We have also open sourced our documentation at https://github.com/awsdocs and the full list of AWS services with their documentation links is at https://docs.aws.amazon.com/index.html#lang/en_us
- You can check the overall health and availability of AWS globally at the Service Health Dashboard (SHD) https://status.aws.amazon.com/ You can also use the AWS Health API to programatically check for service health at https://docs.aws.amazon.com/health/latest/ug/getting-started-api.html
- Netflix have some cool tools that they’ve open sourced. https://netflix.github.io/
- All AWS quickstarts (aka reference architectures) now in github https://github.com/aws-quickstart
- Adrian Cockcroft AWS VP of Cloud Architecture Strategy and former CTO of Netflix. Here’s his Youtube playlist with talks about DevOps, migrations, Netflix lessons learned and digital transformation topics. https://www.youtube.com/playlist?list=PL_KXMLr8jNTnwkzV7SePa0jHFUG2qn0MA
- The Network is Reliable and other fallacies. Key performance and reliability concerns of distributed systems. https://blog.acolyer.org/2014/12/18/the-network-is-reliable/
- Architecture reviews are important. The cloud design principles, here is the 2011 AWS Whitepaper https://media.amazonwebservices.com/AWS_Cloud_Best_Practices.pdf and the Well Architected Review are key inputs. https://aws.amazon.com/architecture/well-architected/
- AWS General Reference. https://docs.aws.amazon.com/general/latest/gr/Welcome.html This document is a key reference when architecting and designing AWS solutions.
- All AWS documentation in one place and neatly categorized. https://docs.aws.amazon.com/index.html?nc2=h_ql_doc#lang/en_us
- Two interesting solutions for transcribing podcasts and creating subtitles
- Implementing Serverless Video Subtitles https://aws.amazon.com/blogs/compute/implementing-serverless-video-subtitles/
- Discovering and indexing podcast episodes using Amazon Transcribe and Amazon Comprehend https://aws.amazon.com/blogs/machine-learning/discovering-and-indexing-podcast-episodes-using-amazon-transcribe-and-amazon-comprehend/
- A great way to understand how Cloudformation can build, update and delete immutable or mutable environments is to reverse engineer AWS Quickstarts (gold standard reference architectures). Check out https://aws.amazon.com/quickstart/saas/identity-with-cognito/ for the deployment guide and https://github.com/aws-quickstart/saas-identity-cognito for all the Cloudformation templates.
- Building mobile apps fast on AWS re Real-Time Offline Ready Chat App written with GraphQL, AWS AppSync, & AWS Amplify https://github.com/aws-samples/aws-appsync-chat
- Turbot is a novel way to manage security controls and permission mappings across multiple cloud and multiple environments including onprem and SaaS offerings. The Turbot team originally built the Johnson and Johnson xbot system to manage multiple account permissions. https://turbot.com/
- Top Security Links of 2019 https://aws.amazon.com/blogs/security/top-11-posts-during-2019/
Security Hardening and Testing
- Test your network, router, website or email hardening. https://internet.nl/test-connection/ for connection test including dnssec and ipv6. Can test websites, routers and email addys. Very difficult to get a 100% pass
DevOps and Agile
- What is DevOps?
- Agile manifesto https://agilemanifesto.org/ 4 behaviours and 12 principles
- Modern summary of agile and DevOps https://gist.github.com/jpswade/4135841363e72ece8086146bd7bb5d91
- DevOps tools
- Summary of the major CICD tools and frameworks https://hostadvice.com/blog/devops-toolbox-jenkins-ansible-chef-puppet-vagrant-saltstack/ including the The SUSE DevOps Framework (aka DevOps Tooling Quagmire)
- Orchestration tooling in lieu of DevOps CICD tooling https://blog.gruntwork.io/why-we-use-terraform-and-not-chef-puppet-ansible-saltstack-or-cloudformation-7989dad2865c
Ian’s list of links for weekly review of all stuff AWS. (trying to keep up with the firehose)
- Search for AWS and Deep Dive or Ninja and you’ll find lots of great videos or slideshares.
- AWS is continually sharing more good stuff on github. https://github.com/aws
- AWS What’s New: https://aws.amazon.com/new/
- And my favourite the weekly AWS Podcast https://aws.amazon.com/podcasts/aws-podcast/
- The first place to start on any of AWS’ more than 125 services is the the faq pages for each product (this is where I start reading) https://aws.amazon.com/faqs/
https://aws.amazon.com/podcasts/aws-podcast/ and all the faq pages for each product (this is where I start reading)
AWS has released a number of webinars and now has a monthly cadence https://aws.amazon.com/about-aws/events/monthlywebinarseries/
AWS Answers is now available to the public. It contains some interesting links. https://aws.amazon.com/answers/
Get to know your Technical Account Manager (TAM) The TAMs provide support for your applications running on AWS. They can help you prepare for major events like testing and scaling. They can also help troubleshoot and provide visibility into AWS infrastructure metrics for troubleshooting. https://aws.amazon.com/premiumsupport/faqs/
AWS Glossary contains service names and nomenclature https://docs.aws.amazon.com/general/latest/gr/glos-chap.html
More useful links
Centralized Logging – AWS Answers | https://aws.amazon.com/answers/logging/centralized-logging/
AWS Developer Forums: Discussion Forums | https://forums.aws.amazon.com/index.jspa
Amazon Web Services - Labs · GitHub | https://github.com/awslabs
GitHub - awslabs/aws-shell: An integrated shell for working with the AWS CLI. | https://github.com/awslabs/aws-shell
Region Table | https://aws.amazon.com/about-aws/global-infrastructure/regional-product-services/
AWS Regions and Endpoints - Amazon Web Services | https://docs.aws.amazon.com/general/latest/gr/rande.html
AWS Service Limits - Amazon Web Services | https://docs.aws.amazon.com/general/latest/gr/aws_service_limits.html
AWS IP Address Ranges - Amazon Web Services | https://docs.aws.amazon.com/general/latest/gr/aws-ip-ranges.html
Error Retries and Exponential Backoff in AWS - Amazon Web Services | https://docs.aws.amazon.com/general/latest/gr/api-retries.html
Cloud Solutions by Application - Amazon Web Services (AWS) | https://aws.amazon.com/solutions/
AWS - Application Architecture Center | https://aws.amazon.com/architecture/
AWS Simple Icons | https://aws.amazon.com/architecture/icons/
Compliance Programs - Amazon Web Services (AWS) | https://aws.amazon.com/compliance/programs/
Case Studies & Customer Success - Amazon Web Services (AWS) | https://aws.amazon.com/solutions/case-studies
AWS Certification - AWS Cloud Computing Certification Program | https://aws.amazon.com/certification/
Best Practice
- Start with the AWS Cloud Security entry page https://aws.amazon.com/security/ Check out the AWS CISO ReInforce security conference highlights at https://reinforce.awsevents.com/
- How to Encrypt and Decrypt Your Data with the AWS Encryption CLI https://aws.amazon.com/blogs/security/how-to-encrypt-and-decrypt-your-data-with-the-aws-encryption-cli/
- A deep dive on AWS from the US Federal Government titled ‘Secure Network Connections - An Evaluation of the US Trusted Internet Connections Program’ aka ‘AWS Government Handbook - Trusted Network Connections’ Trusted Internet Connections (TIC) initiative https://d0.awsstatic.com/whitepapers/compliance/AWS_Secure_Network_Connections.pdf
- Learn from others. An personal reflection on how best to utilize AWS https://wblinks.com/notes/aws-tips-i-wish-id-known-before-i-started/
- Identity management is a complex topic, challenging to implement and maintain. Deep dive on the AWS Identity and Access Management User Guide https://github.com/awsdocs/iam-user-guide/blob/master/doc_source/index.md and check out the example policies.
- You need to dig deep on the AWS Blog and keep up to date with the latest security enablers like - Aggregate container logs using AWS Container Services launches Fluent Bit Plugins for AWS https://aws.amazon.com/about-aws/whats-new/2019/07/aws-container-services-launches-aws-for-fluent-bit/
- Key design principles for any application involving big data. Cost management opportunities are significant. It’s up to you to exploit them. Here’s a comparison between file types and analytics options that highlights the large cost deltas on offer. It’s titled ‘1.1 Billion Taxi Rides on Amazon Athena’ https://tech.marksblogg.com/billion-nyc-taxi-rides-aws-athena.html
- New logging announcements from ReInforce 2019 the AWS security conference
- Centralized Container Logging with Fluent Bit https://aws.amazon.com/blogs/opensource/centralized-container-logging-fluent-bit/ and the Fluent Bit Cloud Native Log Forwarder https://fluentbit.io/
- Amazon EventBridge – Event-Driven AWS Integration for your SaaS Applications https://aws.amazon.com/blogs/aws/amazon-eventbridge-event-driven-aws-integration-for-your-saas-applications/
Compute and Containers
- James Hamilton on AWS new Arm Gravitron and Inferentia chips. https://perspectives.mvdirona.com/2018/11/aws-inferentia-machine-learning-processor/
- Popquiz - How many EC2 options does AWS provide? Hint - https://perspectives.mvdirona.com/2018/11/aws-inferentia-machine-learning-processor/
- Lambda Execution Context explained. https://docs.aws.amazon.com/lambda/latest/dg/running-lambda-code.html
- Understanding container reuse in Lambda https://aws.amazon.com/blogs/compute/container-reuse-in-lambda/
- Best practice for Lambdas. Especially Java based. https://docs.aws.amazon.com/lambda/latest/dg/best-practices.html and https://docs.aws.amazon.com/lambda/latest/dg/java-programming-model.html
- Lambda cold start comparison https://medium.com/@nathan.malishev/lambda-cold-starts-language-comparison-%EF%B8%8F-a4f4b5f16a62 and this also links to the cheekily titled I’m afraid you’re thinking about AWS Lambda cold starts all wrong https://hackernoon.com/im-afraid-you-re-thinking-about-aws-lambda-cold-starts-all-wrong-7d907f278a4f
- Phillip Gerbe’s posts on EC2 autoscaling
- Instrumenting Kubernetes for Observability using AWS X-Ray and Amazon CloudWatch https://github.com/aws-samples/reinvent2018-dev303-code
- Arun Gupta talking about Corretto and OpenJDK https://www.infoq.com/news/2019/03/amazon-releases-corretto-8?utm_campaign=infoq_content&utm_source=twitter&utm_medium=feed&utm_term=java
- A deep dive on Fargate. Lot’s of feature updates around container orchestration so watch the AWS what’s new for the latest. Here’s a slide share that deep dives on Fargate. https://de.slideshare.net/AmazonWebServices/deep-dive-into-aws-fargate Also see https://docs.aws.amazon.com/AmazonECS/latest/developerguide/AWS_Fargate.html
- Deep dive on EC2 instance metadata https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-instance-metadata.html Here’s a QandA from Github with documentation links to how IAM roles work with EC2 instances. https://github.com/ex-aws/ex_aws/issues/30 NOTE be wary of inadvertently exposing credentials when using HTTP proxies, HTML/CSS validator services, and XML processors that support XML inclusion.
- Self paced 10 hr (2+4+4) Kubernetes workshop. AWS Workshop for Kubernetes https://github.com/aws-samples/aws-workshop-for-kubernetes
- Another nice EKS workshop: https://eksworkshop.com/introduction/ .This workshop covers k8 basics and some cool things such as the k8 dashboard, helm etc.. :-) https://ecsworkshop.com/ and https://eksworkshop.com/
Networking links
- Latency http://highscalability.com/latency-everywhere-and-it-costs-you-sales-how-crush-it
- List of CIDR ranges of AWS regions http://ec2-reachability.amazonaws.com/
- Create private links to AWS using Privatelink https://docs.aws.amazon.com/vpc/latest/userguide/vpce-interface.html
- Task Networking in AWS Fargate (How containers can talk to each other over localhost, how we can create microservices networking and using ALB for abstractin scaling from routing) https://aws.amazon.com/blogs/compute/task-networking-in-aws-fargate/
- Network (aka VPC) Deep Dive from 2015 https://s3-eu-west-1.amazonaws.com/awssummit2015/Slides+and+recordings/Tech+track/Deep+Dive+-+Amazon+VPC.pdf A good first start as you begin to deep dive on AWS networking. NOTE: there have been many enhancements to networking performance and reducing complexity of large enterprise networks on AWS over the last few years. ENAs, Transit VPCs, etc.
- A technical comparison between API Gateway vs Application Load Balancer—Technical Details from serverless-training.com https://serverless-training.com/articles/api-gateway-vs-application-load-balancer-technical-details/ Includes working examples. There’s also a Reddit titled Invoking Lambda with ALB vs API Gateway.. https://www.reddit.com/r/aws/comments/a1mirw/invoking_lambda_with_alb_vs_api_gateway/
- VPC Traffic Mirroring – Capture & Inspect Network Traffic gives you another, perhaps more flexible approach for monitoring network traffic to and from ENIs. https://aws.amazon.com/blogs/aws/new-vpc-traffic-mirroring/
- https://aws.amazon.com/blogs/security/how-to-remediate-amazon-inspector-security-findings-automatically/.
Database, Caching and Storage links
- AWS blogs are a great reference. Here’s the database blog link https://aws.amazon.com/blogs/database/
- 24 Jul 2017 S3 Rate Request Performance Increase announcement https://aws.amazon.com/about-aws/whats-new/2018/07/amazon-s3-announces-increased-request-rate-performance/ and notice the exponetial scaling possible with multiple prefixes. https://docs.aws.amazon.com/AmazonS3/latest/dev/request-rate-perf-considerations.html but if using sse-kms this service will a limiting factor. https://docs.aws.amazon.com/kms/latest/developerguide/limits.html#requests-per-second-table
- EFS Performance Deep Dive. The documentation is very informative. Read in this order
- Announcing the New Amazon DynamoDB Key Diagnostics Library | https://aws.amazon.com/about-aws/whats-new/2018/12/announcing-the-new-amazon-dynamodb-key-diagnostics-library/
- DynamoDB Point in Time Restore
- Start here https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/PointInTimeRecovery_Howitworks.html max 4 concurrent restores any time in the last 35 days. There are post restore actions which may need to be manually completed.
- Now consider your specific restore case https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/backuprestore_HowItWorks.html#backuprestore_HowItWorks-restore
- Backups are asynchronous and available for restore in minutes
- You need to rebuild things like Cloudwatch alarms, tags, Auto Scaling, etc
- Restores don’t impact throughput or API performance but you can only write to the restored table once it is active
- It can take up to 20 minutes to restore a table (or longer if your data is skewed). For partitions with billions of items full table restore should take less than 10 hours. Assuming even data distribution across partitions
- Elasticache comparison Memcached and Redis https://docs.aws.amazon.com/AmazonElastiCache/latest/mem-ug/SelectEngine.html
- Loss of caching nodes could be due to underlying hardware failure, reboots or restarts during a maintenance window or even from the loss of an AZ. https://docs.aws.amazon.com/AmazonElastiCache/latest/UserGuide/FaultTolerance.html
- AWS CLI S3 documentation https://docs.aws.amazon.com/cli/latest/topic/s3-config.html#addressing-style
- Everything you ever wanted to know about the Amazon DynamoDB console but were afraid to ask: A detailed walkthrough https://aws.amazon.com/blogs/database/everything-you-ever-wanted-to-know-about-the-amazon-dynamodb-console-but-were-afraid-to-ask-a-detailed-walkthrough/
- Amazon S3 Path Deprecation Plan – The Rest of the Story https://aws.amazon.com/blogs/aws/amazon-s3-path-deprecation-plan-the-rest-of-the-story/
- Rick Houlihan on AWS re:Invent 2018: Building with AWS Databases: Match Your Workload to the Right Database (DAT301) https://www.youtube.com/watch?v=hwnNbLXN4vA&t=19s and here is a discussion about the Rick’s PIE theorem as being more useful in architecting cloud hyper scale databases. It’s titled ‘Why the PIE theorem is more relevant than the CAP theorem’ https://www.alexdebrie.com/posts/choosing-a-database-with-pie/
- Amazon S3 bucket and object permissions are independent. https://docs.aws.amazon.com/AmazonS3/latest/user-guide/set-permissions.html
- Can you have S3 objects with an infinite number of versions? Well now and here’s why. That would also be not cost optimized. https://docs.aws.amazon.com/AmazonS3/latest/dev/ObjectVersioning.html
- Updates or Puts to DynamoDB table not showing up in your GSI? This can happen if your write is missing attributes that the GSI is expecting to be projected. See Global Secondary Indexes documentation (first section) for an explanation. https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/GSI.html
- AWS Big Data course is a broad look at big data services. The Data Warehousing on AWS Course is a deep dive on AWS Redshift and other big data services with a focus on tuning and optimization. https://aws.amazon.com/training/course-descriptions/data-warehousing/
- Some Redshift and ETL blog posts complete with Cloudformation templates. Found these on the big data AWS Blog post
- Orchestrating an ETL process using AWS Step Functions for Amazon Redshift https://aws.amazon.com/blogs/big-data/orchestrating-an-etl-process-using-aws-step-functions-for-amazon-redshift/
- Granting fine-grained access to the Amazon Redshift Management Console https://aws.amazon.com/blogs/big-data/granting-fine-grained-access-to-the-amazon-redshift-management-console/ (No Cloudformation but IAM policy examples)
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark https://aws.amazon.com/blogs/big-data/optimizing-downstream-data-processing-with-amazon-kinesis-data-firehose-and-amazon-emr-running-apache-spark/ Contains 6, yes 6 Cloudformation templates deployed serially.
IAM
- Configuring the AWS CLI https://docs.aws.amazon.com/cli/latest/userguide/cli-chap-configure.html#config-settings-and-precedence
- IAM example policies https://docs.aws.amazon.com/IAM/latest/UserGuide/access_policies_examples.html and policy evaluation logic https://docs.aws.amazon.com/IAM/latest/UserGuide/reference_policies_evaluation-logic.html
- Deep dive on EC2 instance metadata https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/ec2-instance-metadata.html Here’s a QandA from Github with documentation links to how IAM roles work with EC2 instances. https://github.com/ex-aws/ex_aws/issues/30 NOTE be wary of inadvertently exposing credentials when using HTTP proxies, HTML/CSS validator services, and XML processors that support XML inclusion.
Streaming
- FluentD versus Flume https://www.slideshare.net/treasure-data/fluentd-loves-mongodb-at-mongosv-july172012/37-Fluentd_vs_Flume_Easy_to and https://www.slant.co/versus/959/960/~fluentd_vs_flume
- 2014 Kinesis deep dive https://www.youtube.com/watch?v=8u9wIC1xNt8&feature=youtu.be&t=104.+%28AWS+21%29+AWS.+Developing+on+AWS+2.4+%28EN%29%3A+Instructor+Guide.+AWS%2FGilmore.+VitalBook+file and something newer https://www.youtube.com/watch?v=IXcs_e0oTKE
- Kinesis Firehose buffer time configuration https://aws.amazon.com/kinesis/data-firehose/faqs/
- Waiting for a stream to become active and adding some robustness when creating a stream. https://docs.aws.amazon.com/streams/latest/dev/kinesis-using-sdk-java-create-stream.html#kinesis-using-sdk-java-create-the-stream
- For Kinesis Firehose I suggest getting familiar with the documentation. This is a fully managed service and it is designed to be easy to use and highly scalable. This service, like most, has service specific service limits that you need to design to exploit.
- Kinesis Firehose Developer Guide https://docs.aws.amazon.com/kinesis/?id=docs_gateway specifically
- What Is Amazon Kinesis Data Firehose? https://docs.aws.amazon.com/firehose/latest/dev/what-is-this-service.html
- Amazon Kinesis Data Firehose Limits in the developer guide have specific guidance. For example Kinesis Firehose supports Classic Load Balancer and not ALB or NLB (yet?), Redshift clusters must be in a publically accessible cluster and buffering limits vary between Redshift and Elasticsearch. There are other service specific service limits but I’ll let you find those yourself.
Queues and Messaging
- SQS and SNS SLAs. Personal experience demonstrates that SQS is a very reliable service. I can’t recall any customer or personal issues with SQS or SNS over 9+ years. However this anecdotal information is useless for others. Here’s a stackoverflow thread on this topic presumably contributed to by ex AWS folks. https://stackoverflow.com/questions/30750033/amazon-sns-delivery-retry-policies-for-sqs
- as of Jan 2019 SQS and SNS have a 99.9% SLA https://aws.amazon.com/messaging/sla/
Serverless
- Lambda videos to watch
- 2014 ReInvent Lambda. Some early best practices https://www.youtube.com/watch?v=UFj27laTWQA
-
- Probably pass on this one. https://www.youtube.com/watch?v=pBLdMCksM3A
- 2016 Lambda mobile support. https://www.youtube.com/watch?v=copO_JQQsBs
- For a deeper dive into Lambda
- AWS re:Invent 2018: A Serverless Journey: AWS Lambda Under the Hood (SRV409-R1) https://www.youtube.com/watch?v=QdzV04T_kec
- Lambda Internals: Exploring AWS Lambda (2 parts) https://epsagon.com/blog/lambda-internals-exploring-aws-lambda/ and https://epsagon.com/blog/lambda-internals-part-two/
- I’m afraid you’re thinking about AWS Lambda cold starts all wrong https://hackernoon.com/im-afraid-you-re-thinking-about-aws-lambda-cold-starts-all-wrong-7d907f278a4f
- It’s key to use the principles of loose coupling with Lambda functions. Error handling, wait dependencies are better handled with logic outside of individual Lambda functions. Consider using AWS Step Functions to coordinate your Lambdas and business logic. Some examples to help introduce good patterns are:
- Error Processor Sample Application for AWS Lambda https://docs.aws.amazon.com/lambda/latest/dg/sample-errorprocessor.html?icmpid=docs_lambda_landingpage
- Discovering and indexing podcast episodes using Amazon Transcribe and Amazon Comprehend https://aws.amazon.com/blogs/machine-learning/discovering-and-indexing-podcast-episodes-using-amazon-transcribe-and-amazon-comprehend/
- Here is a hack to suspend a Lambda function and then resume. This is definitely an anti pattern and better handled using AWS Step Functions https://medium.com/@galbashan1/aws-lambda-internals-part-2-going-deeper-1e12b9d2515f . While you could do this it is not elegant and could potentially impact the service more broadly. I provide this to highlight the ‘different thinking’ needed to efficiently ’exploit’ AWS services. Loose coupling is a key architectural principle in architecting for cloud and is definitely the approach you should take for serverless and ephemeral functions.
- Here’s a Lambda deep dive which more clearly explains some of the questions around managing state, retries, testing and handling large data sets with Lambda. https://www.youtube.com/watch?v=dB4zJk_fqrU
- Firecracker Announcement (circa Nov 2018) Firecracker – Lightweight Virtualization for Serverless Computing (Secure and fast microVMs for serverless computing and containers). Think of firecracker as next generation ‘fabric’ to replace legacy compute underlying containers, Lambda and edge computing. Firecracker also brings first class security to containers. https://aws.amazon.com/blogs/aws/firecracker-lightweight-virtualization-for-serverless-computing/ and here’s the github page https://firecracker-microvm.github.io/ and the Firecracker documentation https://github.com/firecracker-microvm/firecracker
- Best Practices for Working with AWS Lambda Functions https://docs.aws.amazon.com/lambda/latest/dg/best-practices.html
- Best Practices for Amazon SQS ( SQS and SNS are key services for building loosely coupled architectures) https://docs.aws.amazon.com/AWSSimpleQueueService/latest/SQSDeveloperGuide/working-with-messages.html
Security
- Neat graphic and detailed description of signature version 4 signing of authenticating requests. https://docs.aws.amazon.com/AmazonS3/latest/API/sigv4-query-string-auth.html information about signature version 2 (generally deprecated and less preferred than signature version 4) https://docs.aws.amazon.com/AmazonS3/latest/dev/RESTAuthentication.html
- Deep Dive on the Nitro hypervisor and the security benefits of loosely coupling the hypervisor (or more correctly the compute management system and control plane) in a video titled AWS Live re:Inforce - Security Benefits of the EC2 Nitro Architecture https://www.youtube.com/watch?v=t_9CASbagag And Nitro is also described in detail in Amazon EC2 High Memory instances for SAP HANA: simple, flexible, powerful https://aws.amazon.com/blogs/awsforsap/amazon-ec2-high-memory-instances-for-sap-hana-simple-flexible-powerful/
- AWS Key Management Service Cryptographic Details explains many of the encryption details about the myriad of encryption options available on AWS. https://d0.awsstatic.com/whitepapers/KMS-Cryptographic-Details.pdf
- Security in depth is how you handle DDOS. 1.7 Tbps DDoS Attack — Memcached UDP Reflections Set New Record https://thehackernews.com/2018/03/ddos-attack-memcached.html
- KMS links
- How Key State Affects Use of a Customer Master Key https://docs.aws.amazon.com/kms/latest/developerguide/key-state.html
- KMS Best Practices Whitepaper, a recommended read, https://d1.awsstatic.com/whitepapers/aws-kms-best-practices.pdf
- AWS Key Management Service Cryptographic Details https://d0.awsstatic.com/whitepapers/KMS-Cryptographic-Details.pdf
- Managing Keys in AWS CloudHSM https://docs.aws.amazon.com/cloudhsm/latest/userguide/manage-keys.html
One Line List of services
Entry page for AWS Documentation https://docs.aws.amazon.com/index.html#lang/en_us
165 services as of Mar 2019 according to Andy Jassy https://www.cnbc.com/2019/02/28/amazon-cloud-ceo-we-have-a-30-billion-run-rate-in-our-early-stages.html
# | Name | Description |
---|---|---|
1 | Alexa for Business | Empower your Organization with Alexa |
2 | Amazon API Gateway | “Build, Deploy, and Manage APIs” |
3 | Amazon AppStream 2.0 | Stream Desktop Applications Securely to a Browser |
4 | Amazon Athena | Query Data in S3 using SQL |
5 | Amazon Aurora | High Performance Managed Relational Database |
6 | Amazon Chime | “Frustration-free Meetings, Video Calls, and Chat” |
7 | Amazon Cloud Directory | Create Flexible Cloud-native Directories |
8 | Amazon CloudFront | Global Content Delivery Network |
9 | Amazon CloudSearch | Managed Search Service |
10 | Amazon CloudWatch | Monitor Resources and Applications |
11 | Amazon Cognito | Identity Management for your Apps |
12 | Amazon Comprehend | Discover Insights and Relationships in Text |
13 | Amazon Connect | Cloud-based Contact Center |
14 | Amazon DynamoDB | Managed NoSQL Database |
15 | Amazon EBS | Block Storage for EC2 |
16 | Amazon EC2 | Virtual Servers in the Cloud |
17 | Amazon EC2 Auto Scaling | Scale Compute Capacity to Meet Demand |
18 | Amazon Elastic Container (ECS) Registry | Store and Retrieve Docker Images |
19 | Amazon Elastic Container Service (ECS) | Run and Manage Docker Containers |
20 | Amazon Elastic Container Service for Kubernetes (EKS) | Run Managed Kubernetes on AWS |
21 | Amazon Elastic File System | Managed File Storage for EC2 |
22 | Amazon Elastic Transcoder | Easy-to-use Scalable Media Transcoding |
23 | Amazon ElastiCache | In-memory Caching System |
24 | Amazon Elasticsearch Service | Run and Scale Elasticsearch Clusters |
25 | Amazon EMR | Hosted Hadoop Framework |
26 | Amazon FreeRTOS | IoT Operating System for Microcontrollers |
27 | Amazon GameLift | “Simple, Fast, Cost-effective Dedicated Game Server Hosting” |
28 | Amazon Glacier | Low-cost Archive Storage in the Cloud |
29 | Amazon GuardDuty | Managed Threat Detection Service |
30 | Amazon Inspector | Analyze Application Security |
31 | Amazon Kinesis | Work with Real-time Streaming Data |
32 | Amazon Kinesis Video Streams | Process and Analyze Video Streams |
33 | Amazon Lex | Build Voice and Text Chatbots |
34 | Amazon Lightsail | Launch and Manage Virtual Private Servers |
35 | Amazon Lumberyard | “A Free Cross-Platform 3D Game Engine with Full Source, Integrated with AWS and Twitch” |
36 | Amazon Machine Learning | Machine Learning for Developers |
37 | Amazon Macie | “Discover, Classify, and Protect Your Data” |
38 | Amazon MQ | Managed Message Broker for ActiveMQ |
39 | Amazon Neptune | Fully Managed Graph Database Service |
40 | Amazon Pinpoint | Push Notifications for Mobile Apps |
41 | Amazon Polly | Turn Text into Lifelike Speech |
42 | Amazon Quicksight | Fast Business Analytics Service |
43 | Amazon RDS | “Managed Relational Database Service for MySQL, PostgreSQL, Oracle, SQL Server, and MariaDB” |
44 | Amazon Redshift | “Fast, Simple, Cost-effective Data Warehousing” |
45 | Amazon Rekognition | Analyze Image and Video |
46 | Amazon Route 53 | Scalable Domain Name System |
47 | Amazon S3 | Scalable Storage in the Cloud |
48 | Amazon SageMaker | “Build, Train, and Deploy Machine Learning Models at Scale” |
49 | Amazon Simple Email Service (SES) | Email Sending and Receiving |
50 | Amazon Simple Notification Service (SNS) | “Pub/Sub, Mobile Push and SMS” |
51 | Amazon Simple Queue Service (SQS) | Managed Message Queues |
52 | Amazon Sumerian | Build and Run VR and AR Applications |
53 | Amazon Transcribe | Automatic Speech Recognition |
54 | Amazon Translate | Natural and Fluent Language Translation |
55 | Amazon VPC | Isolated Cloud Resources |
56 | Amazon WorkDocs | Enterprise Storage and Sharing Service |
57 | Amazon WorkMail | Secure and Managed Business Email and Calendaring |
58 | Amazon WorkSpaces | Desktop Computing Service |
59 | Apache MXNet on AWS | “Scalable, High-performance Deep Learning” |
60 | AWS Application Discovery Service | Discover On-Premises Applications to Streamline Migration |
61 | AWS AppSync | Real-time and Offline Mobile Data Apps |
62 | AWS Auto Scaling | Scale Multiple Resources to Meet Demand |
63 | AWS Batch | Run Batch Jobs at Any Scale |
64 | AWS Certificate Manager | “Provision, Manage, and Deploy SSL/TLS Certificates” |
65 | AWS Cloud9 | “Write, Run, and Debug Code on a Cloud IDE” |
66 | AWS CloudFormation | Create and Manage Resources with Templates |
67 | AWS CloudHSM | Hardware-based Key Storage for Regulatory Compliance |
68 | AWS CloudTrail | Track User Activity and API Usage |
69 | AWS CodeBuild | Build and Test Code |
70 | AWS CodeCommit | Store Code in Private Git Repositories |
71 | AWS CodeDeploy | Automate Code Deployment |
72 | AWS CodePipeline | Release Software using Continuous Delivery |
73 | AWS CodeStar | Develop and Deploy AWS Applications |
74 | AWS Command Line Interface | Unified Tool to Manage AWS Services |
75 | AWS Config | Track Resource Inventory and Changes |
76 | AWS Data Pipeline | “Orchestration Service for Periodic, Data-driven Workflows” |
77 | AWS Database Migration Service | Migrate Databases with Minimal Downtime |
78 | AWS Deep Learning AMIs | Quickly Start Deep Learning on EC2 |
79 | AWS DeepLens | Deep Learning Enabled Video Camera |
80 | AWS Device Farm | “Test Android, FireOS, and iOS Apps on Real Devices in the Cloud” |
81 | AWS Direct Connect | Dedicated Network Connection to AWS |
82 | AWS Directory Service | Host and Manage Active Directory |
83 | AWS Elastic Beanstalk | Run and Manage Web Apps |
84 | AWS Elemental MediaConvert | Convert File-based Video Content |
85 | AWS Elemental MediaLive | Convert Live Video Content |
86 | AWS Elemental MediaPackage | Video Origination and Packaging |
87 | AWS Elemental MediaStore | Media Storage and Simple HTTP Origin |
88 | AWS Elemental MediaTailor | Video Personalization and Monetization |
89 | AWS Fargate | Run Containers without Managing Servers or Clusters |
90 | AWS Glue | Prepare and Load Data |
91 | AWS Greengrass | “Local Compute, Messaging, and Sync for Devices” |
92 | AWS Identity and Access Management | Manage User Access and Encryption Keys |
93 | AWS IoT 1-Click | One Click Creation of an AWS Lambda Trigger |
94 | AWS IoT Analytics | Analytics for IoT Devices |
95 | AWS IoT Button | Cloud Programmable Dash Button |
96 | AWS IoT Core | Connect Devices to the Cloud |
97 | AWS IoT Device Defender | Security Management for IoT devices |
98 | AWS IoT Device Management | “Onboard, Organize, and Remotely Manage IoT Devices” |
99 | AWS Key Management Service | Managed Creation and Control of Encryption Keys |
100 | AWS Lambda | Run your Code in Response to Events |
101 | AWS Migration Hub | Track Migrations from a Single Place |
102 | AWS Mobile Hub | “Build, Test, and Monitor Apps” |
103 | AWS Mobile SDK | Mobile Software Development Kit |
104 | AWS OpsWorks | Automate Operations with Chef and Puppet |
105 | AWS Organizations | Policy-based Management for Multiple AWS Accounts |
106 | AWS Personal Health Dashboard | Personalized View of AWS Service Health |
107 | AWS Server Migration Service | Migrate On-Premises Servers to AWS |
108 | AWS Serverless Application Repository | “Discover, Deploy, and Publish Serverless Applications” |
109 | AWS Service Catalog | Create and Use Standardized Products |
110 | AWS Shield | DDoS Protection |
111 | AWS Single Sign-On | Cloud Single Sign-On (SSO) Service using SAML and a user portal for multiple accounts and applications |
112 | AWS Snowball | Petabyte-scale Data Transport |
113 | AWS Snowball Edge | Petabyte-scale Data Transport with On-board Compute |
114 | AWS Snowmobile | Exabyte-scale Data Transport |
115 | AWS Step Functions | Coordinate Distributed Applications |
116 | AWS Storage Gateway | Hybrid Storage Integration also available as hardware from amazon.com |
117 | AWS Systems Manager | Gain Operational Insights and Take Action |
118 | AWS Trusted Advisor | Optimize Performance and Security |
119 | AWS WAF | Filter Malicious Web Traffic |
120 | AWS X-Ray | Analyze and Debug Your Applications |
121 | Elastic Load Balancing | High Scale Load Balancing |
122 | TensorFlow on AWS | Open-source Machine Intelligence Library |
123 | VMware Cloud on AWS | Build a Hybrid Cloud without Custom Hardware |
124 | Secrets Manager | Easily rotate, manage, and retrieve database credentials, API keys, and other secrets through their lifecycle |
125 | AWS Firewall Manager | Centrally configure and manage firewall rules across accounts and applications |
126 | AWS Answers (AWS Answers contains solutions and is not a service per se) | Clear answers to common questions about architecting, building, and running applications on the Amazon Web Services Cloud |
127 | Amazon Kinesis Data Firehose | Prepare and load real-time data streams into data stores and analytics tools |
128 | Amazon Kinesis Data Streams | Ingest and process streaming data with custom applications |
129 | Firecracker | virtualization and open source technology that enables service owners to operate secure multi-tenant container-based services by combining the speed, resource efficiency, and performance enabled by containers with the security and isolation offered by traditional VMs |
130 | AWS Outposts | Bring native AWS services, infrastructure, and operating models to virtually any data center, co-location space, or on-premises facility |
131 | Amazon Robomaker | develop, simulate, and deploy intelligent robotics applications at scale |
132 | AWS DeepRacer | Fully autonomous 1/18th scale race car driven by reinforcement learning, 3D racing simulator, and global racing league |
133 | Amazon Forecast | Fully managed service that uses machine learning to highly accurate forecasts |
134 | Amazon FSx for Windows File Server | Fully managed native Microsoft Windows file system that makes it easy to move Windows-based applications that require file storage to AWS |
135 | Amazon Sagemaker NEO | Train models once, and run anywhere |
136 | Amazon Sagemaker Ground Truth | Automate and crowd source the development of ML training datasets |
137 | AWS GroundStation | Communicate with satellites |
138 | Amazon Personalize | Add performant, real-time personalization and recommendations to their applications easily with no ML experience required |
139 | Amazon TimeStream | Fast, Scalable, Fully Managed Time Series Database |
140 | Amazon Managed Streaming for Kafka | Automatically provisions and runs their Apache Kafka clusters and the Apache Zookeeper nodes |
141 | AWS Inferentia (chip) | Machine learning inference chip designed to deliver high performance at low cost |
142 | Amazon Comprehend Medical | Fully managed, highly accurate deep-learning based medical NLP service |
143 | AWS Well-Architected Tool | Review the state of their workloads and compares them to the latest AWS architectural best practices |
144 | Amazon Quantum Ledger Database (QLDB) | Fully managed ledger database that provides a transparent, immutable, and cryptographically verifiable transaction log owned by a central trusted authority |
145 | AWS Managed Blockchain | Build and manage fully managed scalable blockchain network using open source frameworks Hyperledger Fabric and Ethereum |
146 | AWS ThinkBox | Powerful and easy to use render management system |
147 | AWS License Manager | Set rules to manage, discover, and report software license usage |
148 | AWS App Mesh | easily monitor and control communications across microservices applications. |
149 | AWS Cloud Map | Service discovery for cloud resources |
150 | Amazon RDS on VMware | run Amazon RDS managed relational databases in VMware vSphere on-premises data centers |
151 | AWS Lake Formation | Easily set up a secure data lake in days |
152 | Amazon Sagemaker Reinforcement Learning (RL) | Develop reinforcement learning models at scale |
153 | Amazon Cloudwatch Logs Insights | Fast, Interactive Log Analytics |
154 | Amazon DynamoDB Transactions | Transactions provide atomicity, consistency, isolation, and durability (ACID) in DynamoDB |
155 | Amazon DynamoDB On-Demand | Flexible capacity mode for DynamoDB capable of serving thousands of requests per second without capacity planning |
156 | Amazon Comprehend Medical | natural language processing service to extract relevant medical information from unstructured text |
157 | AWS IoT SiteWise | collect and organize your data from industrial equipment at scale |
158 | AWS IoT Things Graph | connect devices and web services to build IoT applications |
159 | AWS Amplify Console | continuous deployment and hosting service for modern web applications with serverless backends |
160 | AWS Global Accelerator | network layer service (static ip) that you can deploy in front of your internet applications to improve the availability and performance for your globally-distributed user base |
161 | AWS Transit Gateway | connect thousands of Amazon Virtual Private Clouds (VPCs) and their on-premises networks using a single gateway |
162 | AWS IoT Events | detect and respond to events from IoT sensors and applications |
163 | AWS IoT Device Tester | Windows/Linux/Mac test automation tool for connected devices |
164 | AWS DevPay | Deprecated payment service |
165 | AWS Dynamic Training for Deep Learning | open-source deep learning project that allows you to reduce model training cost and time by leveraging the cloud’s elasticity and scale |
166 | AWS Transfer for SFTP | move your file transfer workloads that use the Secure Shell File Transfer Protocol (SFTP) to AWS without needing to modify your applications or manage any SFTP servers |
167 | AWS DataSync | Simplify, Automate, and Accelerate Online Data Transfer between on-premises storage and Amazon S3 or Amazon Elastic File System (Amazon EFS) |
168 | AWS ParallelCluster | Deploy and manage High Performance Computing (HPC) clusters in the AWS cloud |
169 | AWS Resource Manager | Cross account sharing capabilities on Subnets, Transit Gateways and BYOL Manager Configurations with any AWS account or through your AWS Organization |
170 | AWS Client VPN | TLS based secure access to any resource in AWS (EC2, S3, Dynamo DB, etc.) and on-premises from anywhere using OpenVPN based clients |
171 | Amazon Corretto | Free multiplatform and production-ready distribution of the Open Java Development Kit |
172 | AWS Backup | Centrally manage and automate backups across AWS services including EBS, RDS, DynamoDB Tables, EFS and Storage Gateway Volumes |
173 | Amazon DocumentDB | Fast, scalable, highly available MongoDB-compatible database |
174 | Amazon Worklink | Provide secure mobile access to your internal websites and web apps |
175 | AWS Resource Access Manager | AWS RAM enables you to share your resources with any AWS account or organization |
176 | Amazon Data Lifecycle Manager | Create lifecycle policies to automate operations on specified resources |
177 | AWS Tools for PowerShell | PowerShell modules, built on functionality exposed by the AWS SDK for .NET |
178 | AWS CLI | Command line interface for AWS Services |
179 | AWS SDKs | Software development kits for a wide range of languages and protocols |
180 | AWS Silk | Amazon Fire browser Platform |
181 | Amazon Textract | Document text and tabular content detection and analysis |
182 | Amazon Simple Workflow Service (SWF) | Build, run, and scale background jobs that have parallel or sequential steps with state tracking and task coordination |
183 | AWS Security Hub | Comprehensive view of your security state within AWS (Amazon GuardDuty, Amazon Inspector, Amazon Macie and AWS Config) and helps you check your compliance with the security industry standards and best practices. |
184 | AWS Control Tower | Set up and govern your multi-account AWS environment. Uses (AWS Organizations, AWS Service Catalog, AWS Single Sign-on, AWS Config) |
185 | aws-shell | The interactive productivity booster for the AWS CLI |
186 | Amazon Elastic Graphics ( was Amazon EC2 Elastic GPUs ) | Easily and cost-effectively add graphics acceleration to Amazon EC2 Instances |
187 | SimpleDB | Highly available, flexible, and scalable non-relational data store (superseded by DynamoDB but still used internally for EMR) |
188 | AWS Pricing Calculator | Estimate the cost for your architecture solution. Configure a cost estimate that fits your unique business or personal needs with AWS products and services. |
189 | AWS Solutions | Vetted, technical reference implementations designed to help you solve common problems and build faster |
190 | Serverless Application Model (SAM) | extends AWS CloudFormation to provide a simplified way of defining the Amazon API Gateway APIs, AWS Lambda functions, and Amazon DynamoDB tables needed by your serverless application. |
191 | AWS Serverless Application Repository | Discover, deploy, and publish serverless applications |
192 | Amazon CloudWatch Container Insights (in preview) | |
193 | AWS Cloud Development Kit and AWS CDK Tools and AWS Construct Library | open-sourcesoftware development framework to model and provision cloud application resources using familiar programming languages |
194 | AWS for Fluent Bit | |
195 | Amazon EventBridge | A serverless event bus that makes it easy to connect applications together using data from your developers’ applications, Software-as-a-Service (SaaS) applications, and AWS services. EventBridge delivers a stream of real-time data from event sources |
196 | Amazon CloudWatch Anomaly Detection (in preview) | this tool applies machine-learning algorithms to continuously analyze system and application metrics, determine a normal baseline, and surface anomalies with minimal user intervention. |
197 | Amazon SageMaker Managed Spot Training | optimize the cost of training machine learning models using Amazon EC2 spot instances. The enhancement to SageMaker could lower training costs by up to 70% |
Continue reading articles in my Amazon Web Services series