Junior Data Engineer
- Рівень:
- junior
- Джерело:
- djinni.co
Що робити
- Data Pipeline Design & Development: Design, implement, and maintain scalable data pipelines and ETL/ELT processes, primarily using Python and Spark (PySpark), to ingest, transform, and deliver data from various sources into analytics and ML platforms.
- Data Modelling & Warehousing: Design and optimize data models (e.g. star/snowflake schemas), build and manage data warehouses and data lakes, and ensure data structures support reporting, analytics, and ML use cases.
- Data Preparation for ML: Collaborate closely with data scientists and ML engineers to understand data requirements, implement robust preprocessing and feature engineering steps, and ensure datasets are clean, consistent, and suitable for machine learning models.
- Performance & Reliability: Optimize data processing jobs and SQL queries for performance and cost efficiency, monitor data pipelines in production, and ensure reliability, scalability, and adherence to SLAs.
- Governance, Quality & Security: Implement data quality checks, validation frameworks, and governance standards; ensure data security, privacy, and compliance in line with PwC and client requirements.
Схожі вакансії
З блогу Trackr
Усі статті →Знайдено через trackr.help/jobs · Канал: @trackrhelp · Бот для персональних сповіщень: @trackrhelpBot


