Data Engineering
Showing 601–612 of 758 results
Run Azure Databricks Notebooks with Azure Data Factory
Using pipelines in Azure Data Factory to run notebooks in Azure Databricks enables you to automate data engineering processes at cloud scale.
Run Petabyte level OSS NoSQL databases with HDInsight HBase
Learn how HBase provides random access and strong consistency for large amounts of unstructured and semi structured data in a schema less database organized by column families.
Scalable Data Processing in R
Learn how to write scalable code for working with big data in R using the bigmemory and iotools packages.
Searching and Manipulating Data in Excel 2016
If you always find yourself encountering pre-filled Excel workbooks, the skill to search and manipulate data within them is crucial. With topics like data validation and what-if analysis, teaching you those skills is what this course aims to achieve.
Secure a data warehouse in Azure Synapse Analytics
Learn how to approach and implement security to protect your data with Azure Synapse Analytics.
Secure Azure Database for PostgreSQL
Azure Database for PostgreSQL includes comprehensive security features including encryption, authentication, and granting permissions to database users. In this module, you learn about the security features of Azure Database for PostgreSQL.
Secure Couchbase 6 Clusters
Security in Couchbase spans a variety of topics, and this course focuses on the most important ones - user authentication and authorization, auditing activities, redacting sensitive data, and encrypting communications.
Secure data and manage users in Azure Synapse serverless SQL pools
Learn how you can set up security when using Azure Synapse serverless SQL pools
Secure MySQL
Learn about security and encryption in Azure Database for MySQL.
Serverless Data Processing with Dataflow: Develop Pipelines
In this second installment of the Dataflow course series, we are going to be diving deeper on developing pipelines using the Beam SDK.
Serverless Data Processing with Dataflow: Foundations
This course is part 1 of a 3-course series on Serverless Data Processing with Dataflow. In this first course, we start with a refresher of what Apache Beam is and its relationship with Dataflow.
Serverless Data Processing with Dataflow: Operations
In the last installment of the Dataflow course series, we will introduce the components of the Dataflow operational model. We will examine tools and techniques for troubleshooting and optimizing pipeline performance.