Strategic Data Pipeline Design: Improving Operational Efficiency from Oracle to Single Storage Using Airflow S3 Data Pipeline

Authors

  • Piyali Debnath

DOI:

https://doi.org/10.53469/jgebf.2025.07(05).02

Keywords:

Data Integration, ETL, Apache Airflow, Oracle Database, Oracle Functions, Oracle External Directories, SingleStore Pipelines, SingleStore Procedure, Amazon S3 Storage, Data Transformation

Abstract

This paper investigates an innovative ETL pipeline managed by Apache Airflow, integrating Oracle databases with SingleStore through Amazon S3. The architecture enhances efficiency, scalability, and reliability of data integration processes. By implementing a sequence of orchestrated tasks, the study demonstrates improvements in data throughput and process automation compared to traditional ETL techniques. This article is significant as it addresses the need for scalable and efficient ETL processes in enterprise data management, demonstrating the potential improvements over traditional methods.

Downloads

Published

2025-05-29

How to Cite

Debnath, P. (2025). Strategic Data Pipeline Design: Improving Operational Efficiency from Oracle to Single Storage Using Airflow S3 Data Pipeline. Journal of Global Economy, Business and Finance, 7(5), 4–9. https://doi.org/10.53469/jgebf.2025.07(05).02

Issue

Section

Articles