We are looking for critical and creative thinkers to contribute to our team as a Data Engineer. You are a multi-talented engineer who can target high level problems and down and dirty details alike; in this role you will have a hand in every stage of development. You’re independent, comfortable thinking creatively with tools such as Python, SQL, AWS, Spark. You’re willing and able to learn new skills and technologies as the task at hand requires, and are adaptable to fluid and evolving project requirements.
In this role, you will contribute materially towards shaping and realizing the vision of our business; you will learn new technologies and tools, and expand your competence in multiple areas; you will contribute towards fundamentally changing the way an entire industry does business (really).
Work closely with clients to coordinate the transfer of data
Develop tools to extract and process client data from different sources, and tools to profile and validate data
Coordinate with data scientists to transform large amounts of data and store it in a format to facilitate modeling
Contribute to production operations, data pipelines, workflow management, and reliability engineering
We understand that no one is the complete package (but feel free to let us know if you are), so we’ve divided this role’s requirements into the basic qualifications we view as necessary for the job, and those skills that would be helpful, but that can be learned as you go if you don’t already possess familiarity.
BS/MS in CS, or in another quantitative discipline with equivalent experience and self-education
Fluency in a modern scripting language, like Python
Desire to learn new skills and tools (eg. Redshift, Spark, Tableau, etc)
Ability to pay close attention to detail
Skills that would be great but not required
Experience working with a cloud-computing environment (eg. Amazon AWS/EC2)
Comfortable developing shell scripts
A working knowledge of SQL and/or experience with non-relational/alternative databases
Machine learning and/or statistics coursework
Experience with a distributed computing platform/ecosystem (e.g. Hadoop)