Data Warehousing
  • Data Warehousing
  • Readme
  • Fundamentals
    • Terms to Know
    • Jobs
    • Skills needed for DW developer
    • Application Tiers
    • Operational Database
    • What is a Data Warehouse
      • Typical Data Architecture
      • Problem Statement
      • Features of Data Warehouse
      • Need for Data Warehouse
      • Current State of the Art
    • Activities of Data Science
    • Types of Data
    • Data Storage Systems
    • Data Warehouse 1980 - Current
    • Data Warehouse vs Data Mart
    • Data Warehouse Architecture
      • Top-Down Approach
      • Bottom-Up Approach
    • Data Warehouse Characteristic
      • Subject Oriented
      • Integrated
      • Time Variant
      • Non Volatile
    • Tools
    • Cloud vs On-Premise
    • Steps to design a Data Warehouse
      • Gather Requirements
      • Environment
      • Data Modeling
      • Choosing ETL / ELT Solution
      • Online Analytic Processing
      • Front End
      • Query Optimization
    • Dataset Examples
    • Thoughts on some data
  • RDBMS
    • Data Model
      • Entity Relationship Model
      • Attributes
      • Keys
      • Transaction
      • ACID
    • Online vs Batch
    • DSL vs GPL
    • Connect to Elvis
    • SQL Concepts
      • Basic Select - 1
      • Basic Select - 2
      • UNION Operators
      • Wild Cards & Distinct
      • Group By & Having
      • Sub Queries
      • Derived Tables
      • Views
    • Practice using SQLBolt
  • Cloud
    • Overview
    • Types of Cloud Services
    • Challenges of Cloud Computing
    • AWS
      • AWS Global Infrastructure
      • EC2
      • S3
      • IAM
    • Terraform
  • Spark - Databricks
    • Storage Formats
    • File Formats
    • Medallion Architecture
    • Delta
  • Data Warehousing Concepts
    • Dimensional Modelling
      • Star Schema
      • Galaxy Schema
      • Snowflake Schema
      • Starflake Schema
      • Star vs Snowflake
      • GRAIN
      • Multi-Fact Star Schema
      • Vertabelo Tool
    • Dimension - Fact
    • Sample Excercise
    • Keys
      • Why Surrogate Keys are Important
    • More Examples
    • Master Data Management
    • Steps of Dimensional Modeling
    • Types of Dimensions
      • Date Dimension Table
      • Degenerate Dimension
      • Junk Dimension
      • Static Dimension
      • Conformed Dimensions
      • Slowly Changing Dimensions
        • SCD - Type 0
        • SCD - Type 1
        • SCD - Type 2
        • SCD - Type 3
        • SCD - Type 4
        • SCD - Type 6
        • SCD - Type 5 - Fun Fact
      • Role Playing Dimension
      • Conformed vs Role Playing
      • Shrunken Dimension
      • Swappable Dimension
      • Step Dimension
    • Types of Facts
      • Factless Fact Table
      • Transaction Fact
      • Periodic Fact
      • Accumulating Snapshot Fact Table
      • Transaction vs Periodic vs Accumulating
      • Additive, Semi-Additive, Non-Additive
      • Periodic Snapshot vs Additive
      • Conformed Fact
    • Sample Data Architecture Diagram
    • Data Pipeline Models
    • New DW Concepts
Powered by GitBook
On this page
  • Elastic Cloud Compute
  • Tenancy
  1. Cloud
  2. AWS

EC2

Elastic Cloud Compute

Compute: closely related to CPU/RAM

EC2 is a popular service. Other related services are here.

https://aws.amazon.com/products/compute/

EC2 service can be broken down into the following

AMI (Amazon Machine Images)

Instance Types

Instance Purchasing Options

Tenancy

User Data

Storage Options

Security

AMI - Image baseline with an OS, systems, and applications along with any custom configuration.

Custom AMI can also be built with user-defined customizations.

EC2 Instance Types

ECU - EC2 Units

vCPU - virtual CPU

Processor - Type of Processor

AES - Adv encryption system

AVX - adv vector extensions audio /video

Turbo - whether supports turbo boost or not

Purchasing Options

  • On-Demand

  • Reserved

  • Scheduled

  • Spot

  • On-Demand Capacity Reservations

On-Demand

  • It can be launched any time and used for as long as needed.

  • Flat rate.

  • Ideal for development and testing.

Reserved

  • Purchases for a set period for a reduced cost.

  • Complete payment for 1 or 3 year time. Big discount.

  • Partial upfront payment for a smaller discount.

  • No Upfront. The smallest discount is applied.

  • Best suited for long-term, predictable workloads.

Scheduled

  • Reservations on a recurring schedule daily/weekly/monthly.

  • Even if the instance is not used, customers will be charged.

Best suited for scheduled workloads.

Spot Instances

  • Bid for unused EC2 computes resources.

  • No guarantees for a fixed period.

  • Fluctuation of prices based on supply and demand.

  • Chances instance will be taken away if someone bids a higher price.

  • Purchase large EC2 instances at very low prices.

  • Nonpriority tasks.

On-Demand Capacity Reservation

  • Reserve based on different requirements such as AMI, platform, and AZ for a given time.

Tenancy

Shared Tenancy

  • Any available host with required hosts.

  • Multiple customers may use the same host.

Dedicated Instance

  • Hosted on hardware that no other customers can access.

  • Incur additional charges.

  • It may be required to meet compliance.

Dedicated Host

  • Similar to Dedicated Instance

  • Additional visibility and control of the physical host.

  • Allows you to use existing Software license (for example, windows server license purchased outside AWS)

  • Allows using the same host for several instances.

Storage

Persistent Storage

  • Elastic Block Storage (EBS) Volumes / Logically attached via AWS network.

  • Automatically replicated.

  • Encryption is available.

Ephemeral Storage - Local storage

  • Physically attached to the underlying host.

  • When the instance is stopped or terminated, all the data is lost.

  • Rebooting will keep the data intact.

DEMO - Deploy EC2

PreviousAWS Global InfrastructureNextS3

Last updated 2 years ago