Pearson Principal Data Engineer in Durham, North Carolina
At Pearson, our purpose is simple: to add life to a lifetime of learning. We believe that every learning opportunity is a chance for a personal breakthrough. That’s why our 20,000 Pearson employees are committed to creating vibrant and enriching learning experiences designed for real-life impact. We are the world’s leading learning company, serving customers in nearly 200 countries with digital content, assessments, qualifications, and data. For us, learning isn’t just what we do. It’s who we are. Visit us at pearsonplc.com
As a data engineering lead and solutions architect, individual will have to forefront and drive robust data engineering process to complement next gen data analytics using agile methodologies in collaboration with the group of business analysts, data, devops, test and release engineers to plan, estimate, architect the low level technical solution and execute the project along with leading all the technical team on the project to ensure reliable, secure, and cost-effective distributed solutions.
You must demonstrate a strong competency in helping to create and sustain an enterprise-wide environment that fosters accountability, quality, commitment, growth , innovation and adhere the best industrial standards . Individual should work with geographically distributed teams on technical challenges and process improvements.
Must Have Skills
10+ years of practical experience designing and building data solutions .
should have more than 6 years of cloud experience with at least 3 years on GCP architecting and building data lakes /data warehouses / data pipelines / AI & ML solutions using GCP services
Enterprise experience with GCP services like storage&database , data processing and secondary services using including BigQuery, Cloud SQL. , Bigtable, , Pub/Sub, Cloud Composer, Dataflow, Dataproc, Dataprep, Data Studio, Bigtable, Cloud Storage, file store, Cloud VM, Composer, Appengine, GKE or similar cloud experience.
Understand different types of storage (filesystem, relational, NoSQL) and working with various kinds of data (structured, unstructured, metrics, log files, etc.)
Experience in building scalable and reusable data pipelines (ETL, ELT) using airflow and data wrangling procedures using Python and SQL.
Responsible for design setting up of new GCP cloud environments, Sizing and managing deployments
Tune application and query performance using performance profiling tools and SQL
Experience with batch and stream processing (including GCP Dataflow/Kafka Streams/Spark)
Working knowledge of data visualization tools such as Looker and Tableau is a plus.
Experience working with agile software development practices and drive to ship quickly.
Research, analyze, and recommend technical approaches for solving difficult and challenging development and integration problems.
Responsible for maintenance and enhancement of data platform which involves adding various operators for carrying out tasks in Apache Airflow
Accountable that the team adheres to provided estimates and technical design, code review appropriate to the best performance standards
Identify, design and implement internal process improvements by automating manual processes and optimizing data delivery
Experience with Continuous Integration and Automated Test tools such as Jenkins, Artifactory, Git
Experience leading a team of engineers
Bachelor's degree from four-year College or university in Computer Science or related field
Technical certifications such as Google Cloud Data Engineer or advanced certifications in data science a plus.
Experience with microservice patterns, API development, RESTful web services.
Experience with containerization technologies (Docker, Kubernetes)
familiarity or experience with data mining and statistical modeling (e.g., regression modeling, clustering techniques, decision trees, etc.)
Experience with data science models using AI/ML
Exposure to Network services like Cloud CDN / DNS / IDS / NAT / Interconnect / VPN / VPC etc
Learning is the most powerful force for change in the world. More than 20,000 Pearson employees deliver our products and services in nearly 200 countries, all working towards a common purpose – to help everyone achieve their potential through learning. We do that by providing high quality, digital content and learning experiences, as well as assessments and qualifications that help people build their skills and grow with the world around them. We are the world’s leading learning company. Learn more at pearsonplc.com.
Pearson believes that wherever learning flourishes, so do people. We are committed to being an anti-racist company in everything we do. We value the power of an inclusive culture and a strong sense of belonging. We promote a culture where differences are embraced, opportunities are accessible, consideration and respect are the norm, and all individuals are supported in reaching their full potential. Through our talent, we believe that diversity, equity, and inclusion make us a more innovative and vibrant place to work. People are at the center, and we are committed to a sustainable environment and workplace where talent can learn, grow, and thrive.
To learn more about Pearson’s commitment to a diverse and inclusive workforce, please click here: http://www.pearson.com/careers/diversity-and-inclusion.html
Pearson is an Affirmative Action and Equal Opportunity Employer and a member of E-Verify. We are committed to building a team that represents a variety of backgrounds, perspectives, and skills. The more inclusive we are, the better our work will be. All employment is decided based on qualifications, merit, and business need. All qualified applicants will receive consideration for employment without regard to race, ethnicity, color, religion, sex, sexual orientation, gender identity, gender expression, age, national origin, protected veteran status, disability status, or any other group protected by law.
Organization: Corporate Strategy & Technology
Req ID: 7838
- Pearson Jobs