Job Information
Stanford University Data Engineer in Stanford, California
Data Engineer
Business Affairs: University IT (UIT), Redwood City, California, United States
Information Technology Services
Post Date Feb 06, 2024
Requisition # 102181
Job Purpose:
Responsible for performing difficult or complex analysis, design and programming involving multi-project leadership and broad responsibility in support of new and existing Data Engineering Projects and production support activities.
Core Duties:
Collaborate with cross-functional teams to understand business needs and identify opportunities for applying machine learning solutions.
Experience with Natural Language Processing (NLP) and deploying/using Large Language Models
Deep understanding and commitment to software engineering principles/processes (e.g. Lean, Agile, DevOps, CI/CD) and continuous improvement through measurement.
Thorough knowledge, expertise and practice Data Management Framework to design world class data stores. Best practices, Data Quality and security are critical.
Understand data endpoints, consumers and develop strategy.
Fluid end-to-end data vision, design pipelines for seamless data flow.
Lead and perform the design, development, implementation and maintenance of complex Data Store/ Data Lake/Lake house and Data warehousing systems and data-intensive solutions that are scalable, optimized, and fault-tolerant.
Design and implement Data Migration and Data Integration across cloud and hybrid environments.
Mastery and hands-on experience with Data Engineering technologies and scripting languages. Identify new technologies and provide recommendations to Management.
Solid understanding and experience in Cloud technologies and applications. Data Migration, Integration, API’s development, Data Streaming ( Batch and continuous) and scheduling.
Data Modeling skills. Able to come up with a Canonical Data Model and simplify data flow and interaction between different applications. Should be able to integrate new sources smoothly.
Ability to translate complex functional and technical requirements into detailed architecture, design and high performing software.
Design, build and optimize pipelines for data collection for storage, access and analytics.
Out of the box thinking to overcome engineering challenges with innovative design principles.
Minimum Requirement
Education and Experience:
- Bachelor's degree and five years relevant experience or a combination of education and relevant experience.
Knowledge, Skills and Expertise:
Thorough understanding and experience in Data Lake, Lake House and Data Warehousing Architecture. Should be able to suggest, architect and implement Data Lake/Lake house/ DataWarehouse solution with a set of available cloud tools and programming.
Knowledge and ability to use AWS SageMaker Studio, BedRock and design strategy and implement ML Ops Architecture. Should be well versed in AI/ML Programming, using various Large Language Models (LLMs) and build customized solutions.
Hands-on experience and expertise in Advanced SQL, Advanced Python programming, AWS Services ( such as S3, Redshift, Glue ETL, IAM, DMS, Appflow,VPC and others), SnowFlake, FiveTran, KAFKA, Airflow, Oracle Cloud and other open source tools.
Experience in writing reusable complex Python/PySpark scripts for ELT, Business Logic OR APIs. Other Coding experience such as Scala and R Programming is a plus.
Hands-on development work on all aspects of data analysis, data provisioning, modeling, performance tuning and optimization.
Experience in working on AWS cloud environment, using the marketplace for the right tool, efficient utilization of them to meet business requirements.
Mastery of relational, NoSQL or NewSQL database systems. Expertise in working with unstructured, structured and semi-structured data.
Build scalable data pipelines for both real time and batch using best practices in data modeling, ETL/ELT processing using various technology stack .
Experience in designing and implementing tight data security at various levels.
Experience in streaming data from SaaS/PaaS applications . Experience with DataOps and related set of practices, processes and technologies.
Experienced in Data Migration and Data Integration. Know the pain points in Data integration across SaaS applications and implement the best solution that fits the organization.
Constantly monitor operations, tune for better performance and utilization.
PHYSICAL REQUIREMENTS*:
Constantly perform desk-based computer tasks.
Frequently sit, grasp lightly/fine manipulation.
Occasionally stand/walk, use a telephone.
Rarely writing by hand, lift/carry/push/pull objects that weigh up to 10 pounds.
- Consistent with its obligations under the law, the University will provide reasonable accommodation to any employee with a disability who requires accommodation to perform the essential functions of the job.
- May work extended hours, evenings and weekends.
Interpersonal Skills: Demonstrates the ability to work well with Stanford colleagues and clients and with external organizations.
Promote Culture of Safety: Demonstrates commitment to personal responsibility and value for safety; communicates safety concerns; uses and promotes safe behaviors based on training and lessons learned.
Subject to and expected to comply with all applicable University policies and procedures, including but not limited to the personnel policies and other policies found in the University's Administrative Guide,http://adminguide.stanford.edu.
Freedom to grow . We offer career development programs, tuition reimbursement, or audit a course. Join a TedTalk, film screening, or listen to a renowned author or global leader speak.
A caring culture . We provide superb retirement plans, generous time-off, and family care resources.
A healthier you . Climb our rock wall or choose from hundreds of health or fitness classes at our work-class exercise facilities. We also provide excellent health care benefits.
Discovery and fun . Stroll through historic sculptures, trails, and museums.
Enviable resources . Enjoy free commuter programs, ridesharing incentives, discounts and more!
Redwood City . Our new Stanford Redwood City campus, opened in 2019, will be the workplace for approximately 2,700 staff, including University IT, whose jobs are important to supporting the University’s mission. The campus will offer amenities such as onsite cafes and a dining pavilion, a high-end fitness facility with an outdoor pool, and a childcare center for Stanford families.
Schedule: Full-time
Job Code: 4734
Employee Status: Regular
Grade: K
Requisition ID: 102181
Work Arrangement : Hybrid Eligible
Stanford University
- Stanford University Jobs