AWS Kinesis

 AWS Kinesis




AWS Kinesis - 
  • Real-time data steaming service 
  • Used in digest data in real time directly from source 
  • Capacity Modes - 
  • Provisioned -
  • Publishing - 1mb/s per shard or 1000 msg/s per shard 
  • Consuming - 
  • 2mb/s per shard (throughput shared between all consumer)
  • Enhanced Fanout - 2mb/s per shard consumer (dedicated throughput for each consumer )
  • Throughput scales with shards (manual scaling)
  • Pay per shard provisioned per hour 
  • On-demand - 
  • No need to provision or manage the capacity
  •  Default capacity provisioned - 4mb/s or 4000 record/s
  • Scales automatically based on observed throughput peak during the last 30 days 
  • Pay per stream per hour and data in/out per GB

g
  • Not serverless
  • Data retention: 1day (default) to 365 days
  • A record consist of a partition key ( used to partition data coming from multiple publisher) and data blob (max 1mb)
  • Record will be ordered in each shard 
  • Producers SDK, kinesis producer library (KPL), or kinesis agent to publish records
  • Once data inserted in kinesis, it can't me modified or deleted 
  • Ability to reprocess (replay) data


Kinesis Data FireHouse -  
  • Used to load streaming data into target location
  • Writes data in batches efficiently (near real time) - 
  • Buffer size (size of the batch ) - 1 mb to 128 mb (default 5mb)
  • Buffer interval (how long to wait for buffer to fill up) - 60s to 900s (default 300s)
  • Greater the buffer size, higher the write efficiency, longer it will take to fill the buffer 
  • Can ingest data in real time directly from source 
  • Auto scaling
  • Serverless
  •  Pay for data going through firehouse (no provisioning)
  • Supports custom data transformation using lamda function  
  • No replay capabilities (does not store data like KDS)




Kinesis Data Analytics - 
  • Perform real-time analytics on kinesis streams using SQL 
  • Creates streams from SQL query response
  • Cannot ingest data directly from source (ingest data from KDS or KDF)
  • Auto scaling
  • Serverless
  • Pay for the data processed (no provision)
  • Use Cases - 
  • Time-series analytics
  • Real-time dashboard 
  • Real-time metrics 



Kinesis Video Streams - 
  • Capture, process, and store video streams


Comments

Popular posts from this blog

AWS Instance Store

AWS Identity and Access Management

Elastic Block Storage (EBS)