Data Warehousing
  • Data Warehousing
  • Readme
  • Fundamentals
    • Terms to Know
    • Jobs
    • Skills needed for DW developer
    • Application Tiers
    • Operational Database
    • What is a Data Warehouse
      • Typical Data Architecture
      • Problem Statement
      • Features of Data Warehouse
      • Need for Data Warehouse
      • Current State of the Art
    • Activities of Data Science
    • Types of Data
    • Data Storage Systems
    • Data Warehouse 1980 - Current
    • Data Warehouse vs Data Mart
    • Data Warehouse Architecture
      • Top-Down Approach
      • Bottom-Up Approach
    • Data Warehouse Characteristic
      • Subject Oriented
      • Integrated
      • Time Variant
      • Non Volatile
    • Tools
    • Cloud vs On-Premise
    • Steps to design a Data Warehouse
      • Gather Requirements
      • Environment
      • Data Modeling
      • Choosing ETL / ELT Solution
      • Online Analytic Processing
      • Front End
      • Query Optimization
    • Dataset Examples
    • Thoughts on some data
  • RDBMS
    • Data Model
      • Entity Relationship Model
      • Attributes
      • Keys
      • Transaction
      • ACID
    • Online vs Batch
    • DSL vs GPL
    • Connect to Elvis
    • SQL Concepts
      • Basic Select - 1
      • Basic Select - 2
      • UNION Operators
      • Wild Cards & Distinct
      • Group By & Having
      • Sub Queries
      • Derived Tables
      • Views
    • Practice using SQLBolt
  • Cloud
    • Overview
    • Types of Cloud Services
    • Challenges of Cloud Computing
    • AWS
      • AWS Global Infrastructure
      • EC2
      • S3
      • IAM
    • Terraform
  • Spark - Databricks
    • Storage Formats
    • File Formats
    • Medallion Architecture
    • Delta
  • Data Warehousing Concepts
    • Dimensional Modelling
      • Star Schema
      • Galaxy Schema
      • Snowflake Schema
      • Starflake Schema
      • Star vs Snowflake
      • GRAIN
      • Multi-Fact Star Schema
      • Vertabelo Tool
    • Dimension - Fact
    • Sample Excercise
    • Keys
      • Why Surrogate Keys are Important
    • More Examples
    • Master Data Management
    • Steps of Dimensional Modeling
    • Types of Dimensions
      • Date Dimension Table
      • Degenerate Dimension
      • Junk Dimension
      • Static Dimension
      • Conformed Dimensions
      • Slowly Changing Dimensions
        • SCD - Type 0
        • SCD - Type 1
        • SCD - Type 2
        • SCD - Type 3
        • SCD - Type 4
        • SCD - Type 6
        • SCD - Type 5 - Fun Fact
      • Role Playing Dimension
      • Conformed vs Role Playing
      • Shrunken Dimension
      • Swappable Dimension
      • Step Dimension
    • Types of Facts
      • Factless Fact Table
      • Transaction Fact
      • Periodic Fact
      • Accumulating Snapshot Fact Table
      • Transaction vs Periodic vs Accumulating
      • Additive, Semi-Additive, Non-Additive
      • Periodic Snapshot vs Additive
      • Conformed Fact
    • Sample Data Architecture Diagram
    • Data Pipeline Models
    • New DW Concepts
Powered by GitBook
On this page
  • Elastic Cloud Compute
  • Tenancy
  1. Cloud
  2. AWS

EC2

PreviousAWS Global InfrastructureNextS3

Last updated 2 years ago

Elastic Cloud Compute

Compute: closely related to CPU/RAM

EC2 is a popular service. Other related services are here.

EC2 service can be broken down into the following

AMI (Amazon Machine Images)

Instance Types

Instance Purchasing Options

Tenancy

User Data

Storage Options

Security

AMI - Image baseline with an OS, systems, and applications along with any custom configuration.

Custom AMI can also be built with user-defined customizations.

EC2 Instance Types

ECU - EC2 Units

vCPU - virtual CPU

Processor - Type of Processor

AES - Adv encryption system

AVX - adv vector extensions audio /video

Turbo - whether supports turbo boost or not

Purchasing Options

  • On-Demand

  • Reserved

  • Scheduled

  • Spot

  • On-Demand Capacity Reservations

On-Demand

  • It can be launched any time and used for as long as needed.

  • Flat rate.

  • Ideal for development and testing.

Reserved

  • Purchases for a set period for a reduced cost.

  • Complete payment for 1 or 3 year time. Big discount.

  • Partial upfront payment for a smaller discount.

  • No Upfront. The smallest discount is applied.

  • Best suited for long-term, predictable workloads.

Scheduled

  • Reservations on a recurring schedule daily/weekly/monthly.

  • Even if the instance is not used, customers will be charged.

Best suited for scheduled workloads.

Spot Instances

  • Bid for unused EC2 computes resources.

  • No guarantees for a fixed period.

  • Fluctuation of prices based on supply and demand.

  • Chances instance will be taken away if someone bids a higher price.

  • Purchase large EC2 instances at very low prices.

  • Nonpriority tasks.

On-Demand Capacity Reservation

  • Reserve based on different requirements such as AMI, platform, and AZ for a given time.

Tenancy

Shared Tenancy

  • Any available host with required hosts.

  • Multiple customers may use the same host.

Dedicated Instance

  • Hosted on hardware that no other customers can access.

  • Incur additional charges.

  • It may be required to meet compliance.

Dedicated Host

  • Similar to Dedicated Instance

  • Additional visibility and control of the physical host.

  • Allows you to use existing Software license (for example, windows server license purchased outside AWS)

  • Allows using the same host for several instances.

Storage

Persistent Storage

  • Elastic Block Storage (EBS) Volumes / Logically attached via AWS network.

  • Automatically replicated.

  • Encryption is available.

Ephemeral Storage - Local storage

  • Physically attached to the underlying host.

  • When the instance is stopped or terminated, all the data is lost.

  • Rebooting will keep the data intact.

DEMO - Deploy EC2

https://aws.amazon.com/products/compute/