Sr. Data Architect/Data Modeler
The Sr. Data Architect/Data Modeler will be part of the team to develop the new data platform that will enable integration of data sources with the next generation technologies across functional boundaries and enable researchers and decision makers to solve ever more complex problems and make decisions that impact patient outcomes.
Responsibilities:
Work with multiple teams to identify, design and build appropriate dataset and linkages for complex data.
Support refactoring legacy systems into microservices which integrate with a Hadoop data lake.
Contribute to evaluating and selecting new tools for data management and promoting industry best data management practices among the development teams.
Manage metadata for all data sources within a Hadoop data lake.
Organize, deliver, and ensure data integration support.
Develops/manages complex data models (conceptual, logical, & physical) in multiple formats (relational, star/snowflake, object-oriented, etc )
Analyzes & acquires data from primary and secondary data sources – creating mapping specifications/requirements for use by ETL development resources
Maintains knowledge on current and emerging developments/trends for assigned areas of responsibility, assesses the impact, and collaborates with management to incorporate new trends and developments in current and future solutions.
Identifies and recommends process improvements that significantly reduce workloads or improve quality for his/her assigned areas of responsibility.
Determines how existing applications, systems, databases, interfaces and/or hardware can interact to meet new and emerging enterprise initiatives.
Provides input and validates project plans, test plans and implementation plans.
Consults and/or participates in the requirements, design and coding walkthroughs to ensure the development of quality solutions.
Proactively identifies problems and presents/develops solutions
Communicates effectively with internal stakeholders and management.
Required Skills:
5+ Years of experience implementing relational database designs, data warehousing, data architecture, & data modeling, including strong knowledge of various data modeling approaches and practices (experience with 3NF, star schemas, and multi-dimensional designs)
6+ years SOL experience
5+ years creating physical data models for relational databases.
4+ years of experience working in Hadoop environment, including HDFS, Hive, Sqoop, HBase, Pig, Flume, Parquet, Avro and/or Spark.
3+ years of Experience with XML, UML and JSON.
Knowledge of Informatics, analytics, computational science and service management.
Experience with ERwin Enterprise Data Modeler or equivalent modeling software.
Strong written and verbal communication skills.
Preferred Experience:
Ability to logically model healthcare data.
Experience working with Agile methodologies