Data Architect


Company Description

Grupdev LLC, headquartered in New Jersey, USA, is a cloud-first Amazon Web Services (AWS) partner specializing in innovative technology solutions to address real-world challenges. Our multidisciplinary team is dedicated to delivering exceptional customer experiences and building strong, collaborative partnerships. We empower diverse industries, including Finance, Healthcare, and Manufacturing, with cutting-edge solutions in Cloud, Big Data, AI/ML, DevOps, and advanced analytics. By leveraging modern practices in cloud environments, data analytics, and intelligent automation, Grupdev helps clients navigate every stage of digital transformation, ensuring tangible results and growth in the digital age.


Role Description

This is a full-time, remote role for a Data Architect. The Data Architect will be responsible for designing and implementing data architecture frameworks, developing comprehensive data models, and ensuring proper data governance. They will lead Extract, Transform, and Load (ETL) processes, build and optimize data warehouses, and collaborate with cross-functional teams to understand business requirements. Regular tasks include designing data-driven solutions, ensuring data quality and reliability, and staying updated on data architecture best practices and trends.



Years of Expirence : 5 to 7 years


  • Design scalable data architectures and pipelines for Databricks, Delta Lake, and Azure Data Lake Storage (ADLS).
  • Define data ingestion and consumption patterns based on the needs of the use case and the broader needs of the organization.
  • Collaborate with stakeholders to translate business requirements into technical solutions, ensuring data quality and consistency.
  • Work with the business and data teams to define and document data quality and validation rules.
  • Define end to end data flows and integration points aligned to Databricks medallion architecture.
  • Provide recommendations on optimize performance of ETL/ELT pipelines, jobs, and queries for large-scale datasets.
  • Ensure integration of Databricks with other Azure services such as Azure Synapse Analytics, Azure SQL, Azure Event Hubs, and Power BI.
  • Establish governance standards for data usage across the organization.