Post image
Share this:

Senior DevOps / Crawl Engineer

Hanzo Archives is a cutting-edge web archiving company. Global corporations use our products and services to capture, archive, preserve, and make discoverable web-based electronically stored information (ESI) in native format. Their needs are primarily driven by eDiscovery, information governance and heritage requirements. Our operations are based in Europe and USA.

Hanzo has implemented the entire technology stack required to capture and archive the modern web with a sophisticated crawler at its core. This job is at the heart of crawler operations: to configure and manage crawls, process archived data, and interact with customers. We call this “Crawl Engineering”.

Also known as:

  • DevOps with Front End Debugging
  • Web Archive Operations Engineer

We are looking for bright, enthusiastic, self-motivated, self-learning Senior Crawl Engineers. Candidates must have strong diagnostic skills, be able to hold their own in a busy and challenging environment, and thrive on learning and optimising operations and systems. Candidates must demonstrate experience in Python and Javascript, a comprehensive knowledge of the workings of the web, solid Unix / Linux skills, and scripting with command line tools like Find, Grep and Awk.

  • Salary: Negotiable base salary plus participation in share options scheme
  • Location: Home-based or office based in Edinburgh, UK or East Coast, USA
  • To find out more or apply for this job, please email an intro plus your CV to Shuba Rao at shuba.rao@hanzoarchives.com.

Job Description

About the Company

Hanzo Archives is a cutting-edge web archiving company. Global corporations use our products and services to capture, archive, preserve, and make discoverable web-based electronically stored information (ESI) in native format. Their needs are primarily driven by eDiscovery, information governance and heritage requirements. Our customers are some of largest and most successful corporations in their industry. We currently operate in Europe and USA.

Hanzo has implemented the entire technology stack required to archive the modern web and at the core is a sophisticated crawler. This job is at the heart of crawler operations: to configure and manage crawls, process archived data, and interact with customers. We call this “Crawl Engineering”.

Job Summary

Reporting to the Head of Development, the Senior Crawl Engineer will primarily drive the technical aspects of the archiving operations for customers and ensure that we continue to deliver innovative and high-quality services. This will include writing software and tools to help with these tasks; running large, distributed, long-running jobs; instrumenting and metrics gathering; processing large volumes of data; managing virtual infrastructure (machines, storage).

Roles and Responsibilities:

  • Run crawler operations, including configuring crawls, making probers, diagnosing and resolving issues
  • Work within our process which includes monitoring SLAs, updating our issue tracking system
  • Translate feedback from customers and operations into software development to enhance our product and service offerings
  • Maintain and enhance existing software (both internal products and our open source projects)
  • Communicate systematically and at the right time
  • Work proactively, enthusiastically seeking problems in the software and systems and finding solutions
  • Be responsible for completing time-critical day-to-day tasks
  • Solve problems independently and as a team

Skills and Abilities Required for the Role:

  • Diagnose technical problems effectively
  • Work in a startup environment and work on any, sometimes disparate, tasks that need to be completed in a timely manner
  • Document software rigorously
  • Work with and without supervision
  • Problem-solving and thinking laterally, both individually and as part of a team
  • Communicate, and offer or ask for advice when needed
  • Ability to actively seek problems and find solutions
  • Ability to work remotely and with geographically dispersed teams

Person Specification:

Below are essential demonstrable personal attributes for all candidates.

  • Willing to firefight
  • Write quality code
  • Understand and work with other people’s code
  • Solve technical and operational problems
  • Python and Javascript
  • Regular Expressions
  • Unix / Linux, including scripting with tools like grep, find and awk
  • In depth understanding of HTTP and web
  • Write clearly
  • Responsible and self-motivated
  • Eager to learn, teach, and solve problems

Comments are closed.