Web Analytics Made Easy - Statcounter

Senior Data Architect/Engineer - Remote

  • Innovit USA Inc
  • Remote
  • $65,000 - $90,000 yearly

Job Description

Title: Senior Data Architect/Engineer

Experience Level: Senior/Architect Level

Location: 100% Remote

Duration: 3-5 Year Contract



  • Create and maintain optimal universal identity and MDM architecture.
  • Assemble, manage, and ensure the quality of the commonwealth’s inventory of unique identities, including both individuals and organizations.
  • Manage record de-duplication workflows and process matching cases that require manual resolution.
  • Manage data enrichment functions of supported solutions to improve matching capabilities and enhance the value of existing data.
  • Manage and maintain connections with RESTful and other third-party APIs.
  • Provide technical assistance to the PALDS team with assembling large, complex data sets that meet functional / non-functional business area requirements.
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
  • Work with stakeholders including the Executive, Data, Design, and Support teams to assist with data-related technical issues and support their data infrastructure needs.
  • Keep data separated and secure across Agency boundaries such as data centers and cloud regions.
  • Create data tools for analytics and data scientist team members that assist them in building and optimizing data products needed to support ongoing operations and data-driven decision making.
  • Work with data and analytics professionals to strive for greater functionality in our data systems.


  • Experience implementing and maintaining large-scale universal identity solutions, covering millions of unique identities and linking records across multiple source systems.
  • Experience implementing and maintaining MDM solutions as part of an enterprise-scale centralized data hub.   
  • Experience with managing and maintaining connections with third party RESTful APIs.
  • Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases.
  • Experience building and optimizing ‘big data’ data pipelines, architectures, and data sets.
  • Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement.
  • Strong analytic skills related to working with unstructured datasets.
  • Experience building processes supporting data transformation, data structures, metadata, dependency, and workload management.
  • A successful history of manipulating, processing, and extracting value from large, disconnected datasets.
  • Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores.
  • Strong project management and organizational skills.
  • Experience supporting and working with cross-functional teams in a dynamic environment.
  • We are looking for a candidate with 5-7+ years of experience in a Senior Data Architect/Engineer role, who has attained a degree in Computer Science, Statistics, Informatics, Information Systems, or another quantitative field. They should also have experience using the following software/tools:
    • Experience with universal identity and MDM tools: Verato, Tamr, Informatica, Talend, etc.
    • Experience with big data tools: Hadoop, Spark, Kafka, etc.
    • Experience with relational SQL and NoSQL databases, including Oracle, MS SQL Server, Postgres, Cassandra, etc.
    • Experience with data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc.
    • Experience with data integration services solutions from vendors such as Informatica, MuleSoft, Talend, TIBCO, etc.
    • Experience with cloud-based data services such as AWS (EC2, Glue, EMR, RDS, Redshift, etc.) and/or Azure (Azure SQL, Data Factory, Synapse, Databricks, etc.)
    • Experience with stream-processing systems: Kinesis, Storm, Spark-Streaming, etc.
    • Experience with object-oriented/object function scripting languages: Python, R, Java, C++, Scala, etc.

Add Pictures