Description:
Reporting to the Lead Data Engineering, the Data Engineering Specialist is responsible for designing,
developing, and maintaining data integration and transformation processes in our cloud-based
data platform. While experience in Google Cloud Platform (GCP) is a significant asset,
candidates with proven expertise in other major cloud platforms (AWS, Azure) will also be
considered. This role emphasizes data governance, classification, and compliance—leveraging
tools such as Collibra to ensure high-quality, secure, and well-documented data assets.
Relevant du Gestionnaire, Ingénierie des Données, le Spécialiste en Ingénierie de données est responsable de la conception, du développement et de la maintenance des processus d’intégration et de transformation des données sur notre plateforme de données basée sur le cloud.
Key Responsibilities
- Data Integration & Architecture
- Develop and orchestrate data pipelines for ingestion from various sources (e.g. MySQL, Oracle, PostgreSQL, flat files…etc.) into a cloud-based environment and move data around multiple system based on the business needs and requirements.
- Collaborate with Data Analysts and Data Architects on defining data models, requirements, and architecture for optimal performance in databases (e.g. BigQuery or other cloud-based relational databases).
- Ensure robust ETL/ELT processes that support scalability, reliability, and efficient data access.
- Data Governance & Classification
- Implement and maintain data governance frameworks and standards, focusing on data classification, lineage, and documentation.
- Utilize Collibra or similar platforms to manage data catalogs, business glossaries, and data policies.
- Work closely with stakeholders to uphold best practices for data security, compliance, and privacy.
- Process Improvement & Automation
- Identify, design, and implement process enhancements for data delivery, ensuring scalability and cost-effectiveness.
- Automate manual tasks using scripting languages (e.g., Bash, Python) and
- Enterprise scheduling/orchestration tools like Airflow.
- Conduct root cause analysis to troubleshoot data issues and implement solutions that enhance data reliability.
- Cross-Functional Collaboration
- Partner with cross-functional teams (IT, Analytics, Data Science, etc.) to gather data requirements and improve data-driven decision-making.
- Provide subject matter expertise on cloud data services, data classification standards, and governance tools.
- Monitor and communicate platform performance, proactively recommending optimizations to align with organizational goals.
Skills & Qualifications
Technical Expertise
- Experience with at least one major cloud platform (AWS, Azure, GCP), with GCP exposure considered a significant asset.
- Strong understanding of RDBMS (PostgreSQL, MySQL, Oracle, SQL Server) with the ability to optimize SQL queries and maintain database performance.
- Familiarity with version control systems (Git) to manage codebase changes and maintain a clean development workflow.
- Familiarity with data governance and classification concepts, leveraging Collibra or similar platforms to manage data lineage, business glossaries, and metadata.
- Knowledge of Linux/UNIX environments, and experience working with APIs (XML, JSON, REST, SOAP).