Data Scientist - Modeling | Twitter, Inc. | San Francisco, CA
Data Scientist - Modeling
Software Engineering | San Francisco, CA
As a Twitter Data Scientist specializing in modeling,
you will be designing, building, and shipping complex statistical models
that learn from Twitter data. We are looking for folks that are
passionate about understanding data, are well versed in scalable data
mining and machine learning techniques, and love to build models. If
statistical challenges like the Netflix prize and KDD cup
excite you, this is your dream job. A passion for measuring model
quality and iteratively improving them using feature engineering is a
big plus.
Responsibilities
Requirements
Responsibilities
-
Build complex statistical models that learn from and scale to petabytes of data.
-
Use Map-Reduce frameworks such as Pig and Scalding, statistical
software such as R, and scripting languages like Python and Ruby.
-
Write and interpret complex SQL queries for standard as well as ad hoc data mining purposes.
-
Define metrics, understand A/B testing and statistical measurement of model quality.
-
Understand and leverage crowdsourcing and human computation approaches to data labeling
Requirements
-
MS or PhD in Data mining, Machine learning, Statistics, Math,
Engineering, Operations Research, Computer Science, or other
quantitative discipline.
-
Fluent in one or more object oriented languages like Java, C#,
C++, Scala, etc (or equivalent) and scripting languages like Python or
Ruby (or equivalent)
-
Experience with feature engineering and model building
-
Experience with statistical programming environments like R or Matlab.
-
Experience with scripting languages like Python or Ruby etc.
-
Experience in mapping business needs to engineering systems.
-
Plus: Three or more years of industry experience is a plus.
-
Plus: Experience with large datasets and map-reduce
architectures like Hadoop and open source data mining and machine
learning projects is a big plus
No comments:
Post a Comment