Data Integration & ETL with Talend Open Studio Zero to Hero

Add value to your data – with Talend Open Studio for Data Integration, ETL, Data Warehousing, Data Migration, BI

Data. Everywhere. All well-behaved in their own environment. But who actually lets them talk to each other? You do. With data integration. Become a data savant and add value with ETL and your new knowledge!

What you’ll learn

  • connect your data sources, such as files, databases, XML, web services, Google Drive and more formats.
  • build your own integration processes using practical examples and comprehensive scenarios.
  • master the most important transformations like mappings, joins, aggregations and sorting.
  • orchestrate processes into larger units by using preJobs, postJobs, variable and hierachies.

Course Content

  • Course Overview –> 2 lectures • 3min.
  • Data Integration –> 1 lecture • 2min.
  • Setup –> 6 lectures • 14min.
  • “Hello world” example –> 2 lectures • 3min.
  • Get to know the UI –> 5 lectures • 20min.
  • Your first job –> 2 lectures • 9min.
  • Process files –> 5 lectures • 32min.
  • Understand properties –> 6 lectures • 25min.
  • Process databases –> 7 lectures • 26min.
  • Process other formats –> 7 lectures • 38min.
  • Use variables –> 9 lectures • 30min.
  • Transformations –> 22 lectures • 1hr 46min.
  • Data quality –> 7 lectures • 16min.
  • File Management –> 7 lectures • 13min.
  • Job orchestration –> 12 lectures • 37min.
  • Logging –> 9 lectures • 21min.
  • Documentation –> 3 lectures • 13min.
  • Job Deployment –> 3 lectures • 6min.
  • Project Handling –> 4 lectures • 14min.
  • Use Cases –> 4 lectures • 6min.
  • Course conclusion –> 1 lecture • 1min.
  • Bonus section –> 4 lectures • 6min.

Data Integration & ETL with Talend Open Studio Zero to Hero

Requirements

Data. Everywhere. All well-behaved in their own environment. But who actually lets them talk to each other? You do. With data integration. Become a data savant and add value with ETL and your new knowledge!

Talend Open Studio is an open, flexible data integration solution. You build your processes with a graphical editor and over 600 components provide flexibility.

Each section has a practical example and you will receive this complete material at the beginning of the course. So you can not only view each section, but also compare it to your own solution. There are also extensive practical scenarios included. So you’ll be well equipped for practice!

What are the biggest topics you can expect?

  • Installation on different operating systems (Windows, Linux, Mac)
  • understanding and using important data types
  • reading and writing from databases
  • process different file formats, like Excel, XML, JSON, delimited, positional
  • create and use metadata
  • build schemas
  • use helpful keyboard shortcuts
  • retrieve data from WebServices
  • connect to GoogleDrive and fetch data
  • using iteration and loops
  • convert data flows into iterations
  • build and understand job hierarchies
  • All major transformations: Map, join, normalize, pivot, and aggregate data
  • create and extract XML and JSON
  • use regular expressions
  • Orchestrate components in processes
  • Check and improve data quality
  • Use fuzzy matching and interval matching
  • Use variables for different environments
  • Perform schema validation
  • Handle reject data separately
  • Find and fix errors quickly
  • Write meaningful logs
  • Include and react to warnings and aborts
  • Build job hierarchies and pass data between different levels
  • implement and test your own assumptions
  • configure your project for logging, versioning and context loading
  • learn best practices and establish your own
  • document items and have documentation generated

What are you waiting for? See you in the course!

Get Tutorial