This post is over 30 days old. The position may no longer be available

Senior Data Scientist | Machine Learning | Big Data | Text Mining

ClustrData / Tally Analytics , Bangalore · clustr.co.in · Full-time employment · Programming

Who we are:

Clustr is building the first-of-its-kind analytics platform to bring the benefits of Big Data to the small and medium enterprises (SMEs).

For example, whether it’s a small grocery store trying to reduce the dead stock, a manufacturer looking to streamline the supply chain, or a wholesaler planning to expand to a new location, Clustr platform will provide them highly contextual and data-driven insights to help make better commercial and business decisions.

Clustr is a subsidiary of Tally Solutions Pvt Ltd, a name that has become synonymous to ERP software for SMEs. Thus in the crucial teething phase, Clustr is blessed to have sitting next to us a repository of domain knowledge, a stable source of funding and a source of very unique data!

What we are looking for:

  • Habit of going on a date with data. Fondness of dancing with math.
  • Solid track record of building data products; 7 to 12 years of experience with hands-on data science and machine learning.
  • Courage to take on novel data science problems that no one has even attempted before
  • Penchant for hacking. For example, diving into the guts of a Scikit-learn method and modifying the code to suit the unique problem at hand.
  • Ability to quickly translate a business problem into a Data Science problem, and propose numerous potential solutions for the same. 
  • Desire to invent new algorithms, not just use and assemble existing techniques / algorithms.
  • Ability to mentor and coordinate a team of smart and aspiring data scientists.
  • At least one of the following
    • Broad and deep experience working with text data. For example: NLU, query formation, Named Entity recognition, chat-bot building, context-aware semantic search, multilingual indexing.
    • Strong foundations in statistics and experience in deriving / validating deep statistical inferences. For example: how do you determine if the data used to arrive at a statistical inference is adequate or even representative, when you don't know the underlying distribution of the data? 
  • Code-Ninja (or Yoda if you prefer) with at least two of the following languages: Python, R, Java, Weka, C++. You'd be spending more than 50% of your time on hands-on coding. 
  • Experience in working with graph data (e.g. Big Graph Mining) is a plus.
  • Basic knowledge of Big-Data stack is a plus: Spark, Hadoop, Kafka, Cassandra, Scala, Akka,...

What you can look forward to

  • A chance to deploy your unique skills and expertise to impact the backbone of emerging economies around the world.
  • Experience of building unique analytics products on an untapped gold-mine of data, and picking up unique data science skills along the way.
  • A gang of smart, high-energy, self-driven colleagues. Always up for a passionate debate.
  • Galvanising vibes of a small company yet without instabilities of a start-up. No distractions related to funding.
  • Competitive salary.

If that sounds interesting, let's talk!

Job Perks

Free cab to/from office.

Subsidized breakfast, lunch, evening snacks.

Many more...

Apply for this position

Login with Google or GitHub to see instructions on how to apply. Your identity will not be revealed to the employer.

It is NOT OK for recruiters, HR consultants, and other intermediaries to contact this employer