Remote Data Engineer

MyJob

Full-time

Onsite

No experience limit

No degree limit

PA239-Stop / Military Museum, Santiago, Metropolitan Region, Chile

Favourites

Some content was automatically translatedView Original

Description

Job Summary: A technology company is seeking a Data Engineer with experience in designing and implementing ETL pipelines to work on projects involving consolidation of multiple data sources. Key Highlights: 1. Design and implementation of ETL/ELT pipelines for source consolidation. 2. Configuration and development of the platform on AWS. 3. Documentation and handover of ETL flows. A technology company is seeking a Data Engineer to work remotely on a project. · Conditions: **1\.Compensation:** To be agreed upon. **2\.Working Hours:** Monday to Friday, 44-hour workweek. **3\.Work Location: Remote.· 4\.Responsibilities:** Review of documentation and data sources Analyze sample files from up to 6 data sources (brokers/vehicles, JSON assets, etc.) and their available metadata. Identify key fields, business keys, and normalization/anonymization requirements. Technical design of the ETL solution Define the common data model for the 6 sources (schemas, data types, partitions, and naming conventions in S3\). Design the ETL flow Platform configuration on AWS Create and/or adjust S3 buckets, folder structures, and basic permissions for the data lake. Configure Glue Catalog (tables and databases) and basic Glue resources for orchestration. Development of ETL pipelines for up to 6 sources **Implement ingestion jobs:** file reading, field typing, error handling. **Implement normalization jobs:** column mapping to standard model, basic enrichments, generation of curated datasets ready for computation. Incorporate minimum data quality rules (mandatory fields, data types, value ranges) and logging of rejected records. Testing and fine-tuning Execute tests using real/example data across the 6 sources, document incidents, and adjust transformations. Measure processing times and review partitioning structure to optimize subsequent queries. Documentation and handover Document ETL flows (simple diagrams, job/table descriptions, S3 paths, source-specific rules). Conduct a handover session with the client’s team to explain how to operate and extend the pipelines. · 5\.Requirements: **Minimum Experience:** 3 years as a Data Engineer working with ETL processes. **Experience in:** Design and implementation of ETL/ELT pipelines (ideally in projects consolidating multiple data sources). **AWS Data Handling:** **Mandatory:** S3, IAM, data-oriented compute services (AWS Glue, AWS Lambda or similar). **Desirable:** Athena and/or Redshift for data testing/validation. Use of SQL for queries and validations; Python desirable for transformation scripts. Working with data formats such as CSV, Excel, JSON. \-Requirements\- Minimum Education: University / IP / CFT.3 years of experienceKeywords: data, datos, engineer, engineers, ingeniero, ingeniera, ing, engineer, home, remote, remoto, remote work, trabajo desde casaA technology company is seeking a Data Engineer to work remotely on a project. · Conditions: **1\.Compensation:** To be agreed upon. **2\.Working Hours:** Monday to Friday, 44-hour workweek. **3\.Work Location: Remote.· 4\.Responsibilities:** Review of documentation and data sources Analyze sample files from up to 6 data sources (brokers/vehicles, JSON assets, etc.) and their available metadata. Identify key fields, business keys, and normalization/anonymization requirements. Technical design of the ETL solution Define the common data model for the 6 sources (schemas, data types, partitions, and naming conventions in S3\). Design the ETL flow Platform configuration on AWS Create and/or adjust S3 buckets, folder structures, and basic permissions for the data lake. Configure Glue Catalog (tables and databases) and basic Glue resources for orchestration. Development of ETL pipelines for up to 6 sources **Implement ingestion jobs:** file reading, field typing, error handling. **Implement normalization jobs:** column mapping to standard model, basic enrichments, generation of curated datasets ready for computation. Incorporate minimum data quality rules (mandatory fields, data types, value ranges) and logging of rejected records. Testing and fine-tuning Execute tests using real/example data across the 6 sources, document incidents, and adjust transformations. Measure processing times and review partitioning structure to optimize subsequent queries. Documentation and handover Document ETL flows (simple diagrams, job/table descriptions, S3 paths, source-specific rules). Conduct a handover session with the client’s team to explain how to operate and extend the pipelines. · 5\.Requirements: **Minimum Experience:** 3 years as a Data Engineer working with ETL processes. **Experience in:** Design and implementation of ETL/ELT pipelines (ideally in projects consolidating multiple data sources). **AWS Data Handling:** **Mandatory:** S3, IAM, data-oriented compute services (AWS Glue, AWS Lambda or similar). **Desirable:** Athena and/or Redshift for data testing/validation. Use of SQL for queries and validations; Python desirable for transformation scripts. Working with data formats such as CSV, Excel, JSON. \-Requirements\- Minimum Education: University / IP / CFT.3 years of experienceKeywords: data, datos, engineer, engineers, ingeniero, ingeniera, ing, engineer, home, remote, remoto, remote work, trabajo desde casa **Salary:** 0 CLP/MONTH.

Source: indeed View original post

Sofía Muñoz

MyJob · HR

Company

MyJob

Sofía Muñoz

MyJob · HR