Data Engineering
Showing 589–600 of 829 results
Oracle: Oracle Autonomous Database Administration Workshop
Learn the fundamentals of Autonomous databases and take your DBA skills knowledge to the next level and learn to deploy and administer Autonomous databases. Explore planning, implementing, and deploying an Oracle Autonomous Database, the first and only autonomous database service in the cloud.In this module you will learn how to create applications on Autonomous Database using SQL, APEX, and Oracle Machine Learning.
Orchestrate data movement and transformation in Azure Data Factory or Azure Synapse Pipeline
In this module, you will learn how Azure Data Factory can orchestrate large scale data movement by using other Azure Data Platform and Machine Learning technologies.
Organizational Privacy Engineering
Explore organizational approaches to designing privacy-aware systems and frameworks for data governance.
Pandas Joins for Spreadsheet Users
Learn how to effectively and efficiently join datasets in tabular format using the Python Pandas library.
Perform advanced streaming data transformations with Apache Spark and Kafka in Azure HDInsight
In this module, you learn how to create real-time streaming data analytics pipelines and applications on the cloud by using Azure HDInsight with Apache Kafka and Apache Spark.
Perform code-free transformation at scale with Azure Data Factory or Azure Synapse Pipeline
In this module, you will learn how to perform common data transformation and cleansing activities within Azure Data Factory without using code.
Perform data analysis with Azure Databricks
Learn how to perform data analysis using Azure Databricks. Explore various data ingestion methods and how to integrate data from sources like Azure Data Lake and Azure SQL Database. This module guides you through using collaborative notebooks to perform exploratory data analysis (EDA), so you can visualize, manipulate, and examine data to uncover patterns, anomalies, and correlations.
Perform data engineering with Azure Synapse Apache Spark Pools
Apache Spark is a highly scalable distributed processing solution for big data analytics and transformation. You can leverage its power in Azure Synapse Analytics by using Spark pools.
Perform incremental processing with spark structured streaming
You explore different features and tools to help you understand and work with incremental processing with spark structured streaming.
Perform Zero ETL analytics with HDInsight Interactive Query
By the end of this module, you can perform ad hoc queries on a big-data set. Using HDInsight Interactive Query helps to achieve sub second query latencies.
Performing Database Operations in the Cloudant Dashboard
Master performing database operations within the IBM Cloudant Dashboard. Learn to interact with your NoSQL database, manage data, and run queries directly from the dashboard for cloud-based applications.
Performing Table and CRUD Operations with Cassandra
Discover how to perform CRUD operations and work with tables in Cassandra, a NoSQL database designed for scalability. Learn to handle large datasets, query efficiently, and use the Cassandra Query Language (CQL).