Posts

Showing posts from November, 2020

Google Cloud Dataflow

  Unified stream and batch data processing that's serverless, fast, and cost-effective. Benefits  Streaming data analytics with speed Dataflow enables fast, simplified streaming data pipeline development with lower data latency. Simplify operations and management Allow teams to focus on programming instead of managing server clusters as Dataflow’s serverless approach removes operational overhead from data engineering workloads. Reduce total cost of ownership Resource autoscaling paired with cost-optimized batch processing capabilities means Dataflow offers virtually limitless capacity to manage your seasonal and spiky workloads without overspending. Key Features Streaming Engine Streaming Engine separates compute from state storage and moves parts of pipeline execution out of the worker VMs and into the Dataflow service back end, significantly improving autoscaling and data latency. Autoscaling Autoscaling lets the Dataflow service automatically choose the appropriate number o...

What is Google Cloud Dataproc

  Dataproc is a fast, easy-to-use, low-cost and fully managed service that lets you run the Apache Spark and Apache Hadoop ecosystem on Google Cloud Platform. Dataproc provisions big or small clusters rapidly, supports many popular job types, and is integrated with other Google Cloud Platform services, such as Cloud Storage and Cloud Logging, thus helping you reduce TCO. Top Alternatives to Google Cloud Dataproc Amazon  EMR. Databricks. Azure HDInsight. Cloudera. Apache Spark for Azure HDInsight. Qubole. Microsoft SQL. Snowflake.

What is Google Compute Engine

infrastructure as a service  (IaaS) Google doc VM infrastructure Alternatives / Competitors Amazon EC2 . DigitalOcean Droplets. Azure  Virtual Machines. Linodes Standard. Rackspace Technology. Hostwinds. OpenStack . CloudStack.

What is Google Kubernetes Engine (GKE)

 Container cluster management Alternatives / Competitors Amazon Elastic Container Service (Amazon ECS) Mirantis  Kubernetes Engine  (formerly  Docker  Enterprise) Red Hat OpenShift Container Platform. Kubernetes . Azure  Kubernetes  Service (AKS) IBM Cloud  Kubernetes  Service. AWS Fargate. Amazon Elastic  Kubernetes  Service (Amazon EKS)

What is Google Cloud App Engine

App Engine  Managed app platform Build highly scalable application on a fully managed serverless platform Different Types  App Engine Standard Environment Intended to  Language : Python, Java, Node.js, PHP, Go Experiences sudden and extreme spikes of traffic which requires immediate scaling Instance startup time in second App Engine Flexible environment Go for appengine standard environment when you want less customizations (runtime, vm instances, security, etc. ) and want to plug and play with Google App Engine  Go for appengine flexible environmentwhen you want to use compute engine resources extensively and deploy third party docker components/frameworks within your infrastructure Alternatives / Competitors AWS Lambda . AWS Elastic Beanstalk. Salesforce Heroku. Salesforce Platform. Cloud Foundry . Dokku. Azure Cloud Services . ServiceNow Now Platform.

What is Google Cloud Functions

 Serverless execution environment. Alternatives / Competitors AWS Lambda ,  Google App Engine , Azure Functions, Firebase, and Heroku

Google Cloud Spanner

 Cloud Spanner is a horizontally scalable RDBMS Cloud Spanner supports: ● Automatic replication ● Strong global consistency ● Managed instances with high availability ● SQL (ANSI 2011 with extensions) Cloud Spanner is especially suited for applications requiring: ● A SQL RDBMS, with joins and secondary indexes ● Built-in high availability ● Strong global consistency ● Database sizes exceeding ~2 TB ● Many IOPS (Tens of thousands of reads/writes per second or more) Alternatives / Competitors Amazon Aurora . Oracle Database . IBM Db2 . PostgreSQL . MySQL. Microsoft SQL. SAP HANA . Amazon DynamoDB.

Google Cloud SQL

 Cloud SQL is a managed RDBMS ● Offers MySQL and PostgreSQL databases as a service ● Automatic replication ● Managed backups ● Vertical scaling (read and write) ● Horizontal scaling (read) ● Google security Alternatives / Competitors Microsoft Access Azure SQL Database MySQL MongoDB Atlas Oracle Database Cloud Service  Microsoft SQL Server  Oracle Autonomous Database  Best Google Cloud SQL alternatives for small businesses  InfluxDB  Amazon RDS  Amazon Aurora  IBM Cloudant  Azure SQL Database  Couchbase Couchbase  IBM Cloud Databases

Google Cloud Storage

 Cloud Storage is binary large-object storage  ● High performance, internet-scale ● Simple administration ○ Does not require capacity management ● Data encryption at rest ● Data encryption in transit by default from Google to endpoint ● Online and offline import services are available Example uses of Cloud Storage Transfer Service include: ● Backing up data to a Google Cloud Storage bucket from other storage providers. ● Moving data from a Standard Storage bucket to a Nearline Storage bucket to lower your storage costs. Alternatives / Competitors Amazon  Simple Storage Service (S3) Azure Blob Storage. DigitalOcean Spaces. IBM Cloud Object Storage. Oracle Cloud Infrastructure Object Storage. MinIO . Zadara Storage. Alibaba Object Storage Service.  

Google Cloud Datastore

 Cloud Datastore is a horizontally scalable NoSQL DB  ● NoSQL designed for application backends  ● Fully managed   ○ Uses a distributed architecture to automatically manage scaling  ● Built-in redundancy  ● Supports ACID transactions Cloud Datastore features ● Atomic transactions : Datastore can execute a set of operations where either all succeed, or none occur.  ● High availability of reads and writes : Datastore runs in Google data centers, which use redundancy to minimize impact from points of failure.  ● Massive scalability with high performance : Datastore uses a distributed architecture to automatically manage scaling. Datastore uses a mix of indexes and query constraints so your queries scale with the size of your result set, not the size of your data set.  ● Flexible storage and querying of data : Datastore maps naturally to object-oriented and scripting languages and is exposed to applications through multiple clients. It also prov...

Choose Right Storage Option - GCP

Image
 

Google Cloud Bigtable

Cloud Bigtable is managed NoSQL ● Fully managed NoSQL, wide-column database service for terabyte applications ● Integrated ○ Accessed using HBase API ○ Native compatibility with big data, Hadoop ecosystems Cloud Bigtable is Google's NoSQL big data database service. It's the same database that powers many core Google services, including Search, Analytics, Maps, and Gmail. You can use Cloud Bigtable to store and query all of the following types of data: ● Marketing data, such as purchase histories and customer preferences ● Financial data, such as transaction histories, stock prices, and currency exchange rates ● Internet of Things (IoT) data, such as usage reports from energy meters and home appliances ● Time-series data, such as CPU and memory usage over time for multiple servers ● Personalisation ● Recommendation ●  Geo spatial datasets ● Graphs Why choose Cloud Bigtable? ● Replicated storage ● Data encryption in-flight and at rest ● Role-based ACLs ● Drives major applications...