Discover smart ways of handling databases by mastering NoSQL-Cassandra, HBase, MongoDB and Couchbase.
This is a Combo Training Course in NoSQL that includes some of the best NoSQL tools like HBase, MongoDB and Cassandra. This Course provides you hands-on experience of working on real-time projects that are designed as per industry requirements.
Cassandra Course Content
Advantages and Usage of Cassandra
Introduction to Cassandra, its strengths and deployment areas
CAP Theorem and No SQL DataBase
Significance of NoSQL, RDBMS Replication, Key Challenges, types of NoSQL, benefits and drawbacks, salient features of NoSQL database. CAP Theorem, Consistency.
Cassandra fundamentals, Data model, Installation and setup
Installation, introduction to Cassandra, key concepts and deployment of non relational database, column-oriented database, Data Model – column, column family,
Cassandra Configuration
Token calculation, Configuration overview, Node tool, Validators, Comparators, Expiring column, QA
Summarization, node tool commands, cluster, Indexes, Cassandra & MapReduce, Installing Ops-center
How Cassandra modelling varies from Relational database modelling, Cassandra modelling steps, introduction to Time Series modelling, comparing Column family Vs. Super Column family, Counter column family, Partitioners, Partitioners strategies, Replication, Gossip protocols, Read operation, Consistency, Comparison
Multi Cluster setup
Creation of multi node cluster, node settings, Key and Row cache, System Key space, understanding of Read Operation, Cassandra Commands overview, VNodes, Column family
Thrift/Avro/Json/Hector Client
JSON, Hector client, AVRO, Thrift, JAVA code writing method, Hector tag
Datastax installation part,• Secondary index
Cassandra management, commands of node tool, MapReduce and Cassandra, Secondary index, Datastax Installation
Advance Modelling
Rules of Cassandra data modelling, increasing data writes, duplication, and reducing data reads, modelling data around queries, creating table for data queries
Deploying the IDE for Cassandra applications
Understanding the Java application creation methodology, learning key drivers, deploying the IDE for Cassandra applications,cluster connection and data query implementation
Cassandra Administration
Learning about Node Tool Utility, cluster management using Command Line Interface, Cassandra management and monitoring via DataStax Ops Center.
Cassandra API and Summarization and Thrift
Cassandra client connectivity, connection pool internals, API, important features and concepts of Hector client, Thrift, JAVA code, Summarization.
Apache HBASE Course Content
HBase Overview
Getting started with HBase, Core concepts of HBase, Understanding HBase with an Example
Architecture of NoSQL
Why HBase?, Where to use HBase?, What is NoSQL?
HBase Data Modeling
HDFS vs.HBase, HBase Use Cases, Data Modeling HBase
HBase Cluster Components
HBase Architecture, Main components of HBase Cluster
HBase API and Advanced Operations
HBase Shell, HBase API, Primary Operations, Advanced Operations
Integration of Hive with HBase
Create a Table and Insert Data into it, Integration of Hive with HBase, Load Utility
File loading with both load Utility
Putting Folder to VM, File loading with both load Utility
MongoDB Course Content
Introduction to NoSQL and MongoDB
RDBMS, types of relational databases, challenges of RDBMS, NoSQL database, its significance, how NoSQL suits Big Data needs, Introduction to MongoDB and its advantages, MongoDB installation, JSON features, data types and examples.
MongoDB Installation
Installing MongoDB, basic MongoDB commands and operations, MongoChef (MongoGUI) Installation, MongoDB Data types.
Importance of NoSQL
The need for NoSQL, types of NoSQL databases, OLTP, OLAP, limitations of RDBMS, ACID properties, CAP Theorem, Base property, learning about JSON/BSON, database collection & document, MongoDB uses, MongoDB Write Concern – Acknowledged, Replica Acknowledged, Unacknowledged, Journaled, Fsync.
CRUD Operations
Understanding CRUD and its functionality, CRUD concepts, MongoDB Query & Syntax, read and write queries and query optimization.
Data Modeling & Schema Design
Concepts of data modeling, difference between MongoDB and RDBMS modeling, Model tree structure, operational strategies, monitoring and backup.
Data Management & Administration
In this module you will learn MongoDB® Administration activities such as Health Check, Backup, Recovery, database sharding and profiling, Data Import/Export, Performance tuning etc.
Data Indexing and Aggregation
Concepts of data aggregation and types, data indexing concepts, properties and variations.
MongoDB Security
Understanding database security risks, MongoDB security concept and security approach, MongoDB integration with Java and Robomongo.
Working with Unstructured Data
Implementing techniques to work with variety of unstructured data like images, videos, log data, and others, understanding GridFS MongoDB file system for storing data.
Couchbase Course Content
Introduction to Couchbase
The Architecture of Couchbase, understanding Couchbase distributed NoSQL database engine, vBuckets for information distribution on Couchbase cluster, user and system requirements, Couchbase downloading and installation.
Single Node Implementation
Couchbase single node deployment for development purpose
Couchbase Web Console
Managing the Couchbase environment with the Web Console tool, configuring the Couchbase server and management, working with Couchbase data buckets, default bucket sizing, and administration.
Couchbase Multi Node Cluster
Methods for deploying Couchbase in multi node cluster – all Couchbase Servers on one machine and second with each Couchbase Server on own machine.
Couchbase Command-line Interface
The Couchbase Command-line Interface tools for managing and monitoring single node and multi node clusters, Severs and vBuckets, developing Reports for log data collection.
Cassandra Project
Type : Deploying the IDE for Cassandra applications
Topics : This project gives you a hands-on experience in installing and working with Apache Cassandra which is a high performance and extremely scalable database for distributed data with no single point of failure. You will deploy the Java Integrated Development Environment for running Cassandra, learn about the key drivers, work with Cassandra applications in a cluster setup and implement data querying techniques.
Apache HBASE Project
Project 1 – Integrate Hive & Java with HBase
Topics : This is project that gives you hands-on experience to connect Hive and Java with HBase. Hive is used for querying using HiveQL that translates SQL-like queries into MapReduce jobs on Hadoop framework. In this project you will do HBase Installation, create Hive for HBase, import the data onto Hive from HBase, use HiveQL for Hive Table data querying and analyzing, and managing the HBase Table. You will also learn to Integrate Java with HBase to run HBase queries using Java applications that you
MongoDB Project
Java is one of the most popular programming languages for working with MongoDB. This project tells you how to work with the MongoDB Java Driver, and using MongoDB as a Java Developer. Become proficient in creating a table for inserting video using Java programming. Some of the tasks and steps involved are as below–
- Installation of Java
- Setting up MongoDB JDBC Driver
- Connecting to the database
- Understanding about collections and documents
- Reading and writing basics from the database
- Learning about the Java Virtual Machine libraries
Couchbase Project
Topics : This project involves working with the Couchbase command-line interface tools that are used for managing of clusters in a multi-node or single node setup, working with vBuckets in Couchbase server, deploying Reports for log data collection. You will gain hands-on experience in deploying commands like start, stop and report status for log collection. It also includes working with Couchbase-cli, cbcollect_info tool and so on. Upon completion of the project you will be proficient in using Couchbase CLI for managing and monitoring clusters, data replication using XDCR.