Data Analytics Engineer at AsperaLabs

IBM Emeryville, CA

AsperaLabs is a team of scientists and engineers within the Aspera team dedicated to understanding and developing solutions for the basic problems associated with large data, including access, analysis, visualization and transport. The team collaborates with academia, Aspera technology partners, and customers to carry out basic research that advances the state of art, and also benefits the operational capabilities and product excellence for Aspera.

The Big Data team of AsperaLabs is focused on solving fundamental problems around the analysis and large-scale visualization of large data sets. At Aspera the problem space ranges from analysis of data associated with transfer performance and networkutilization, to product and customer usage patterns and troubleshooting patterns, to large scale market analysis, to time-series machine learning and modeling. We are interested in collaborations with fellow researchers and partners to develop advanced algorithms, approaches, methodologies and systems to capture, analyze and visualize the information.

Aspera currently has an opportunity for a Data Analytics Engineer in our Emeryville, CA office. This role will focus on big data architectural problems in the Hadoop Ecosystem creating new architectures for logging and metadata processing for real-time analysis and visualization.


Create new data processing pipelines using statistical and machine learning techniques and applying open technologies (Apache HDFS, Spark/MapReduce, Redis, etc.);
Develop Machine Learning models for time-series analysis and performance prediction;
Perform data analysis, feature extraction and modeling of time-series data, focused on signal processing;
Assist in the research and development of the next generation performance automation test suites.

Required Technical and Professional Expertise

Bachelor's degree or higher in computer science, engineering, mathematics or a related technical discipline;
Minimum of 3 years' experience in software engineering; knowledge of software development life cycle. This includes applying principles, techniques, procedures, and equipment to the design, and production of goods and services;
Strong programming background in C/C++, Spark, Ruby, Python, Go and a strong foundation of regular expressions;
Demonstrated experience in one of the database technologies: SQL, Relational database (MySQL, Oracle or SQL Server) and efficient database design, NoSQL (HBase, Redis, MongoDB or Riak);
Ability to identify complex problems and review related information to develop and evaluate options and implement solutions;
Ability to innovate mathematical methods or formulas to solve problems;
Ability to collaborate with team members and work independently;
Excellent communications skills (both verbal and written).

Preferred Technical and Professional Experience

Master's degree or higher in computer science, engineering or a related technical discipline; Graduate Degree (MSc or PhD preferred);
Experience with web application programming and distributed systems;
Demonstrated excellence in an R&D or academic environment.