Job Description
Job Description
- 5+ years of hands-on experience in developing large scale applications using Cassandra (Apache or DataStax).
- Proficient in Cassandra APIs, tools to import variety of data formats like CSV, Excel, JSON, XML etc.
- Strong understanding of CQL, Data Modeling in-order to achieve highly performant data access.
- Strong experience in data modeling in Cassandra to design efficient storage model to meet variety of business needs
- Ability to troubleshoot & investigate stability, performance issues.
- Strong hands-on experience of programming with Java, Python, Scala etc.
- Experience with at least one clustered/distributed computing systems, HBase, Hadoop/MapReduce, Spark/SparkR, Lucene/ElasticSearch, Storm, Graph Databases etc
- Experience building data pipelines for structured/unstructured, real-time/batch, events/synchronous/asynchronous using MQ, Kafka, Steam processing.
- Experience with SQL, NoSQL, relational database design, and methods for efficiently retrieving data for Time Series Analytics.
- Experience gathering and processing raw data at scale (including writing scripts, web scraping, calling APIs, write SQL queries, etc.)
- Experience in Machine Learning, Deep Learning, Data Science, Apache Spark, Python
- Experience with Cloud architecture & service like AWS, Azure
- Experience with Graph, Semantic Web, RDF Technologies, SOLR or ElasticSearch
Job Responsibilities
Work - with solution architect & business analysts to define implementation design & coding of the assigned modules/responsibilities with the highest quality (bug-free).
- Determine technical approaches to be used, and defining the appropriate methodologies
- Must be capable of working in a collaborative, multi-site environment to support rapid development and delivery of results and capabilities (i.e. AGILE SDLC)
- Effectively communicating technical analyses, recommendations, status, and results to the project management team.
- Strong hands-on experience with Cassandra, CQL, data modeling, data replication, clustering, indexing for handling for large datasets.
- Expertise in managing Cassandra Cluster has Cassandra Admin using DataStax Enterprise Edition (preferred).
- Strong experience with at least one programming language like Java, Python, Scala.
- Experience working with distributed systems, clustering, and replication technologies.
- Well versed with development toolkits like Maven, GIT, Continues Integration suites, automated deployments, JIRA, Wiki.
Required Education Qualification:
- Bachelor's or Masters degree in Computer Science, Computer or Electrical Engineering, Mathematics, or a related field.