
Fact Sheet
India (Remote / Hybrid)
Full-Time
4–5+ years
About Boston Insights
About Boston Insights
Boston Insights is revolutionizing the clinical supply chain with BI-AXIOM—an AI-powered, real-time command center that unifies data, decisions, and risk management across the clinical supply lifecycle. As we scale our SaaS based modular platform, we seek a strategic Principal Platform Architect to build our next-generation foundation.
About AXIOM
AXIOM is our flagship Clinical Supply Chain Command Center — a SaaS platform that provides pharmaceutical organizations with end-to-end visibility, intelligence, and control across their clinical trial supply networks. AXIOM integrates data from planning, manufacturing, logistics, and RTSM systems into a unified platform, enabling proactive risk sensing, predictive analytics, and real-time decision-making.
Role Overview
We are looking for a Lead Data Engineer to own the design, development, and optimization of AXIOM's data engineering layer within Microsoft Fabric. This is a hands-on technical leadership role where you will architect scalable data pipelines, build Lakehouse and Warehouse structures, and drive data quality and governance across our multi-tenant SaaS platform. You will work closely with the product and architecture teams to deliver analytics-ready data that powers real-time clinical supply chain intelligence for global pharmaceutical companies.
Key Responsibilities
• Design and build end-to-end data ingestion and transformation pipelines using Microsoft Fabric Data Factory, Dataflows Gen2, Spark, and PySpark notebooks
• Architect and maintain Lakehouse and Warehouse structures within Microsoft Fabric, implementing medallion architecture (Bronze, Silver, Gold layers)
• Manage OneLake file structures, Delta tables, partitioning strategies, and storage governance for a multi-tenant SaaS environment
• Develop incremental loading frameworks, data quality validation, and error handling mechanisms
• Optimize query performance, schema design, and data models to support Power BI semantic models and real-time analytics
• Implement CI/CD pipelines for data engineering artifacts using Azure DevOps and Git integration
• Collaborate with Power BI developers, Power Apps developers, and data analysts to ensure seamless data delivery for reporting and application layers
• Support multi-tenant workspace isolation (workspace-per-customer model on shared Fabric capacity) and enforce tenant-level data security
• Build and maintain integrations with external systems including ERP, MES, WMS, CTMS, RTSM/IRT, and 3PL platforms
• Produce and maintain comprehensive technical documentation covering data models, pipeline logic, workspace structure, and governance policies
• Mentor junior data engineers, establish coding standards, and conduct code reviews
• Stay current with Microsoft Fabric updates, new features, and best practices to continuously improve the platform
Required Qualifications
• Bachelor's or Master's degree in Computer Science, Data Engineering, Information Technology, or a related field
• 4–5+ years of hands-on experience in data engineering, with at least 1–2 years working with Microsoft Fabric
• Strong proficiency in SQL, PySpark, Python, and Delta Lake
• Deep experience with Microsoft Fabric components: Lakehouse, Warehouse, Data Factory, Dataflows Gen2, Notebooks, and OneLake
• Solid understanding of ETL/ELT patterns, data modeling (star schema, snowflake schema), and data warehousing concepts
• Experience with Azure services: Azure Data Lake, Azure Key Vault, Azure DevOps, Microsoft Entra ID
• Proficiency in Power BI integration — semantic models, datasets, and DirectLake mode
• Experience with CI/CD workflows for data engineering (Git integration, deployment pipelines)
• Strong understanding of data governance, security, and compliance in multi-tenant environments
• Excellent written and verbal communication skills; ability to produce clear technical documentation
Preferred Qualifications
• Microsoft Certified: Fabric Data Engineer Associate (DP-700) certification
• Experience in pharmaceutical, life sciences, or healthcare supply chain domains
• Familiarity with clinical trial data systems (CTMS, RTSM/IRT, CDISC standards)
• Experience with Power Apps, Power Automate, or Power Pages development
• Exposure to AI/ML integration within Microsoft Fabric (Synapse Data Science, notebooks)
• Experience architecting multi-tenant SaaS data platforms
• Knowledge of Kusto Query Language (KQL) and Real-Time Analytics in Fabric
What We Offer
• Opportunity to build a category-defining product in clinical supply chain technology from the ground up
• Direct impact on global pharmaceutical supply chains and patient outcomes
• Work with cutting-edge Microsoft Fabric and Azure technologies
• Collaborative, startup-paced environment with significant ownership and growth potential
• Competitive compensation and benefits
• Remote-first work culture with flexibility
Take Action
Learn how Boston Insights equips you to anticipate risk, act faster, and keep patients on schedule.

Fact Sheet