Jump to content
Toggle sidebar
UNITApedia
Search
English
Personal tools
Create account
Log in
Pages for logged out editors
learn more
Talk
Contributions
Navigation
Main Page
User Guide
Documentation
Viewpoints
Structural
Strategic
Beneficiary
Semantic
Infrastructure
Data
Beneficiaries
UNITA Participants
GEMINAE
Collectives
Agile Management Guide
Quality Management Process
Tools
What links here
Related changes
Special pages
Page information
Page values
In other languages
Editing
User Guide
(section)
Page
Discussion
English
Read
Edit
Edit source
View history
More
Read
Edit
Edit source
View history
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
= ETL Pipeline Tutorials = This three-part tutorial series walks you through the complete '''ETL (Extract, Transform, Load)''' process used to feed structured data into the UNITApedia platform. Using [https://unitapedia.univ-unita.eu/strapi Strapi], [https://unitapedia.univ-unita.eu/hop Apache HOP], and [https://unitapedia.univ-unita.eu/pga PostgreSQL], we show how to create and manage data flows from form input to dynamic visualization. === Part I β Strapi: Data Collection Form === In this first part, we show how to use [https://unitapedia.univ-unita.eu/strapi Strapi] to create a structured form for data collection. You'll learn how to configure content types, add validation rules, and publish entries that will later be extracted and processed in our pipeline. <html> <div style="position: relative; overflow: hidden; padding-top: 56.25%;"> <iframe src="https://share.synthesia.io/embeds/videos/b32a5947-b6e1-4ba5-b030-5cb5eaa315fc" loading="lazy" title="Strapi Tutorial - ETL Pipeline Part I" allowfullscreen allow="encrypted-media; fullscreen;" style="position: absolute; width: 100%; height: 100%; top: 0; left: 0; border: none; padding: 0; margin: 0; overflow:hidden;"></iframe> </div> </html> === Part II β Apache HOP: Transformation Pipeline === This part focuses on [https://unitapedia.univ-unita.eu/hop Apache HOP], where we create a basic transformation pipeline to extract data from [https://unitapedia.univ-unita.eu/strapi Strapi], process and filter it, and then store it in a [https://unitapedia.univ-unita.eu/pga PostgreSQL] datamart. This transformation prepares the data for use in reporting and analysis tools. <html> <div style="position: relative; overflow: hidden; padding-top: 56.25%;"> <iframe src="https://share.synthesia.io/embeds/videos/7b430445-c85f-4c28-b1f0-a2c6318d305b" loading="lazy" title="Apache HOP Tutorial - ETL Pipeline Part II" allowfullscreen allow="encrypted-media; fullscreen;" style="position: absolute; width: 100%; height: 100%; top: 0; left: 0; border: none; padding: 0; margin: 0; overflow:hidden;"></iframe> </div> </html> === Part III β MediaWiki: Data Integration === In the final part, we use the External Data extension in MediaWiki to fetch and display the transformed data on our UNITApedia site. You'll learn how to configure database queries, create pages dynamically, and format the output for semantic use. The data displayed here originates from the [https://unitapedia.univ-unita.eu/pga PostgreSQL] datamart populated by [https://unitapedia.univ-unita.eu/hop Apache HOP] from [https://unitapedia.univ-unita.eu/strapi Strapi] entries. <html> <div style="position: relative; overflow: hidden; padding-top: 56.25%;"> <iframe src="https://share.synthesia.io/embeds/videos/99bed61a-fabc-4682-88ff-a29e687abfe4" loading="lazy" title="MediaWiki Integration - ETL Pipeline Part III" allowfullscreen allow="encrypted-media; fullscreen;" style="position: absolute; width: 100%; height: 100%; top: 0; left: 0; border: none; padding: 0; margin: 0; overflow:hidden;"></iframe> </div> </html>
Summary:
Please note that all contributions to UNITApedia are considered to be released under the Creative Commons Zero (public domain) (see
UNITApedia:Copyrights
for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource.
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Debug data: