Data Engineer

Apply now »

Date: Dec 7, 2023

Location: Pune, IN

Company: Springer Nature Group

Springer Nature opens the doors to discovery for researchers, educators, clinicians and other professionals. Every day, around the globe, our imprints, books, journals, platforms and technology solutions reach millions of people. For over 175 years our brands and imprints have been a trusted source of knowledge to these communities and today, more than ever, we see it as our responsibility to ensure that fundamental knowledge can be found, verified, understood and used by our communities – enabling them to improve outcomes, make progress, and benefit the generations that follow. 

Visit: group.springernature.com and follow @SpringerNature

 

Job Title:     Data Engineer

Location(s):   Pune

 

About Springer Nature Group

Springer Nature opens the doors to discovery for researchers, educators, clinicians and other professionals. Every day, around the globe, our imprints, books, journals, platforms and technology solutions reach millions of people. For over 180 years our brands and imprints have been a trusted source of knowledge to these communities and today, more than ever, we see it as our responsibility to ensure that fundamental knowledge can be found, verified, understood and used by our communities – enabling them to improve outcomes, make progress, and benefit the generations that follow.

 

About Springer Nature

 

Springer Nature Technology and Publishing Solutions, is the technology and publishing solutions arm of the Springer Nature Group. We leverage our insight in the publishing domain and acquire, produce and deliver content across media and markets using our Technology and Publishing Solutions. With a focus on technology driven solutions and deep insight in the publishing domain, Springer Nature Technology and Publishing Solutions offers a range of services that help our Group brand acquire, produce and deliver content in the most efficient ways possible. We are driven by over 1000 professionals in Technology, Research & Analysis and Marketing shared services.

 

We are proud to be an equal-opportunity employer. All applicants will be considered for employment on the basis of merit alone, without attention to age, race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status

 

About the Role:

 

Springer Nature seeks a Data Engineer for its highly-regarded Nature Research Intelligence, group. The group meets the needs of Springer Nature’s Research division which includes Nature, Springer, BioMedCentral and Scientific American, as well as developing new data products for the research community. This is an exciting opportunity as Data Engineering is expanding from strong foundations into new solutions, and we are looking for someone who can deliver solutions and work independently, with support from the wider team where necessary.

As a Data Engineer, you will be responsible for ensuring a continuous flow of data with minimum latency between data sources, you will be developing, testing and deploying data pipelines into the production environment. You will be a part of the team which delivers ML/AI solutions at scale.

You will be working in close partnership with data analysts, data scientists, and data engineers, as well as other colleagues from Springer Nature and our technology partners, including Google. You will have opportunities to work with the latest data and analytics technologies, including graph databases, Google BigQuery, Tensorflow, and Plotly Dash, and preview new technologies from Google and other partners.

 

Role Responsibilities:

 

  • Build streaming/batch Data pipelines for extraction/loading/transforming data between various data sources at scale in different formats.
  • Work closely with Data Scientists /Analysts/Product Managers to understand the requirements and develop data solutions in line with the business requirements.
  • Explore various best practices to Deliver/Deploy/Maintain the in-house ML/AI solutions at scale.
  • Automate/orchestrate various template solutions to ensure continuous delivery.
  • Maintain the current cloud infrastructure and help onboard the new applications.
  • Use creative ideas to ensure ease of Data Use within the organization.

 

Experience, Skills & Qualifications:

Experience: 2-4 years

Qualification: University degree with a strong analytical/quantitative background or equivalent experience (e.g. Data Science, Statistics, Mathematics, Econometrics, Physics, Computer Science etc.

 

Essential

 

  • SQL and Python
  • problem-solving capabilities
  • at least one of the distributive frameworks such as Apache Beam or Spark
  • Well organized and accurate with good time management

 

Desirable

 

  • Machine Learning concepts are beneficial but not essential as training will be provided
  • schema designing data modeling
  • Google Cloud products (BigQuery, Dataform, Colab) or other cloud data platforms beneficial

 

#LI-DP1

At Springer Nature, we value the diversity of our teams and work to build an inclusive culture, where people are treated fairly and can bring their differences to work and thrive. We empower our colleagues and value their diverse perspectives as we strive to attract, nurture and develop the very best talent.

Springer Nature was awarded Diversity Team of the Year at the 2022 British Diversity Awards. Find out more about our DEI work here.

 

If you have any access needs related to disability, neurodivergence or a chronic condition, please contact us so we can make all necessary accommodation.

Apply now »