Strategic Data Pipeline Design: Improving Operational Efficiency from Oracle to Single Storage Using Airflow S3 Data Pipeline
DOI:
https://doi.org/10.53469/jgebf.2025.07(05).02Keywords:
Data Integration, ETL, Apache Airflow, Oracle Database, Oracle Functions, Oracle External Directories, SingleStore Pipelines, SingleStore Procedure, Amazon S3 Storage, Data TransformationAbstract
This paper investigates an innovative ETL pipeline managed by Apache Airflow, integrating Oracle databases with SingleStore through Amazon S3. The architecture enhances efficiency, scalability, and reliability of data integration processes. By implementing a sequence of orchestrated tasks, the study demonstrates improvements in data throughput and process automation compared to traditional ETL techniques. This article is significant as it addresses the need for scalable and efficient ETL processes in enterprise data management, demonstrating the potential improvements over traditional methods.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Piyali Debnath

This work is licensed under a Creative Commons Attribution 4.0 International License.