Machine Learning Pipeline / Big Data Engineer

Job Description / Skills Required

Requisition ID: R15721
Are you passionate about solving interesting and challenging Big Data / Machine Learning (ML) / Deep Learning (DL) pipeline problems? Do you have expertise and interest in solving interesting business problems using large-scale analytics and algorithms? If yes, you could be a great fit for this role with the Data Science team at Groupon. The Data Science team performs complex analysis over large data sets and develop ML-based data science solutions using cutting edge technology At any given time, the team works on a portfolio of projects that have a direct impact on Groupon’s business.

Responsibilities:

Provide thought leadership and execute on challenging data engineering problems that aim to make Groupon's marketplace more efficient.

Design, develop and automate robust data pipelines that consolidate data from various batch or streaming sources.

Work with other engineering partners, develop and maintain ML and DL pipelines and integrate them into the product base.

Optimize ML/DL pipelines on current and next generation hardware including GPUs.

Convert data into well-thought features that powers data science algos.

Requirements:

CS degree with 4+ years of experience as a SWE/Data Engineer

Excellent understanding of common families of models, feature engineering, feature selection, and other practical machine learning issues

2+ years of industry experience with designing/developing/productionalizing end-to-end ML/DL pipelines

Experience with Spark (SparkML, Spark Streaming, SparkSQL, Py-Spark, SparkR, etc.)

Experience with distributed Big Data technologies (Hadoop, HBase, Cassandra, Kafka, Storm, Elastic Search)

Experience with open source data pipeline technologies (Oozie, Airflow, Luigi, Azkaban, etc.)

Experience with data visualization, developing Analytics dashboard/deep dive solutions (Kibana, Tableau, etc.)

Experience with ML frameworks like TensorFlow, Caffe, Theano, Torch.

Experience in Python, R. Scala/Java experience is preferred

Knowledge of ML-as-a-service, Lamba Architecture for ML, developing frameworks for detection of model decay is a big plus

Groupon provides a global marketplace where people can buy just about anything, anywhere, anytime. We’re enabling real-time commerce across an expanding range of categories including local businesses, travel destinations, consumer products, and live or lively events. At the same time, we are providing advertising options and tools that merchants can use to grow and manage their businesses. Culturally, we believe that great people make great companies and that starting with the customer and working backward moves us forward. Community matters to us on an internal, local and global scale—it’s fundamental to our company’s growth and to the well-being of the world at large. We also value self-awareness, candor, lunch and WiFi. If we match with you, please apply to join us.