Summary
Overview
Work history
Education
Skills
References
Timeline
Generic
FAIZ ALAM

FAIZ ALAM

Dubai

Summary

Strategic Senior Manager in Data Engineering with expertise in Python, SQL, and Scala, delivering enterprise-scale cloud data solutions on GCP, AWS, and Azure. Proven leader in designing and implementing Data Lakes, Lakehouse architectures, and real-time streaming platforms using Apache Spark, Kafka, Airflow, and Kubernetes. Adept at driving data quality, governance, and compliance (GDPR/CCPA) while enabling digital transformation and e-commerce growth. Experienced in integrating emerging technologies, including generative AI, to accelerate analytics and business innovation.

Overview

14
14
years of professional experience
2010
2010
years of post-secondary education

Work history

Senior Manager/ Data Architect

Majid Al Futtaim
Dubai
03.2021 - 09.2025

Led the end-to-end architecture, design, and delivery of Majid Al Futtaim's enterprise-scale Carrefour Omni-Channel Data Lake on Big Data platforms (GCP & Azure), establishing a single source of truth for both in-store and digital commerce data. This strategic platform underpins the organization's reporting, advanced analytics, machine learning, forecasting, and operational decision-making capabilities.

  • Conceived and executed the data architecture blueprint from the ground up, leveraging Google Cloud (BigQuery, Dataflow, Pub/Sub) and Azure Synapse/Data Lake services, fully aligned with corporate vision and digital transformation objectives.
  • Defined and enforced enterprise data standards, governance frameworks, and quality controls, implementing Databricks Delta Lake and governance features to ensure accuracy, compliance, and trust in analytical outputs.
  • Directed cross-functional engineering teams to implement real-time streaming and batch ingestion pipelines using Databricks, Apache Spark, and GCP Dataflow, enabling unified, low-latency access to omnichannel datasets across multiple regions.
  • Partnered with C-level stakeholders to translate high-level business priorities into scalable technical solutions, optimizing performance and cloud cost efficiency across GCP and Azure environments.
  • Delivered a future-ready platform supporting multi-domain analytics, retail media activation, AI/ML integration, and predictive modeling, built on a foundation of Big Data technologies and Databricks collaborative workflows.

Solution Architect

Fast Retailing
Tokyo, TOKYO
09.2019 - 02.2021

Architected and delivered the RFID Store Inventory platform, establishing an external inventory database to provide real-time and batch visibility into store-level stock positions, improving accuracy and responsiveness across the global supply chain.

  • Designed and implemented end-to-end data pipelines leveraging Kafka (streaming), Airflow/Composer (orchestration), and BigQuery (analytics) to unify store, warehouse, and logistics data for high-quality, low-latency insights.
  • Enabled predictive analytics and operational dashboards on top of RFID data, giving business users enhanced visibility into supply chain movements, stockouts, and shrinkage patterns, significantly improving SCM efficiency.
  • Directed cross-functional collaboration across engineering and business stakeholders, from POC through production rollout, ensuring alignment with corporate digital transformation objectives.
  • Established operational readiness by transitioning the platform to the support team, creating monitoring processes, and handling change requests and enhancements to sustain scalability and reliability.

Engineer III (Senior Data Engineer)

Walmart Labs
03.2016 - 08.2019

Spearheaded the development of Walmart's global Merchant Datalake , an enterprise-wide single source of truth consolidating data across merchandising, pricing, and supply chain domains. This strategic platform powered highly responsive dashboards and self-service analytics, enabling business leaders to make proactive, data-driven decisions to optimize sales and operations.

  • Architected and delivered high-volume ingestion pipelines from Teradata, DB2, and Mainframe binary files into HDFS/Hive, supporting datasets with billions of records .
  • Designed and implemented complex, performance-optimized HQL transformations , reducing query execution times and improving analytical responsiveness for business users worldwide.
  • Integrated Datalake outputs with ThoughtSpot and other BI tools, enabling real-time insights for merchandising and category management teams.
  • Established data quality controls and governance processes , ensuring accuracy and trustworthiness of mission-critical analytics.
  • Led onboarding and technical training programs for new engineers, accelerating ramp-up time and fostering best practices in distributed data processing.
  • Delivered robust, scalable workflows using Datastage and Talend , ensuring reliable data delivery across heterogeneous environments.

Senior Software Engineer

Mindtree ltd.
07.2014 - 03.2016

Software Engineer

Ariba an SAP Company
04.2011 - 07.2014

Education

Bachelor of Engineering -

VTU

Skills

    Core Skills & Expertise

  • Programming & Data Engineering: Python, SQL, Scala, PySpark, Shell Scripting, dbt (Data Build Tool)
  • Cloud & Analytics Platforms: Google Cloud Platform (BigQuery, Pub/Sub, Composer), AWS (Redshift, Glue, Lambda), Azure Synapse Analytics, Databricks, Snowflake
  • Data Architecture & Warehousing: Data Lakes, Lakehouse Architectures, Real-Time Streaming, Batch ETL, Data Mesh, Data Vault 20
  • Big Data & Distributed Systems: Apache Spark, Apache Kafka (Confluent & Managed), Apache Flink, Hive, Druid, Greenplum, Netezza, Teradata
  • Orchestration & Workflow Automation: Apache Airflow, Kubernetes, Cloud Composer, CI/CD (GitLab, GitHub Actions)
  • Data Quality & Governance: Data Observability, Great Expectations, Monte Carlo, GDPR/CCPA Compliance, Master Data Management (MDM)
  • Business & Domain Expertise: E-Commerce & Omni-Channel Retail Strategy, Retail Media Data Integration, AdTech & Marketing Analytics, Digital Transformation
  • Emerging Technologies: Generative AI for Analytics (Vertex AI, Bedrock), ML Pipeline Orchestration (Kubeflow, Vertex AI Pipelines), Real-Time Personalization Engines

References

References available upon request

Timeline

Senior Manager/ Data Architect

Majid Al Futtaim
03.2021 - 09.2025

Solution Architect

Fast Retailing
09.2019 - 02.2021

Engineer III (Senior Data Engineer)

Walmart Labs
03.2016 - 08.2019

Senior Software Engineer

Mindtree ltd.
07.2014 - 03.2016

Software Engineer

Ariba an SAP Company
04.2011 - 07.2014

Bachelor of Engineering -

VTU
FAIZ ALAM