Skip to main content

Job Opportunities at the Internet Archive

Senior Engineer: Development Operations

About the Internet Archive: Internet Archive is a 501(c)(3) non-profit library founded in 1996. Our motto is Universal Access to All Knowledge. We collect web sites, books, audio, videos, software, and other types of media and make them available to the world for free. Based in San Francisco and with satellites around the world, the Internet Archive staffers are building the digital library of the future--a place where we can all go to learn and explore. Our 140 engineers, book scanners, librarians, designers and team members have built the #250 website in the world, https://archive.org.

Location: San Francisco, CA

Job Classification: Full-time, exempt

Job Summary: The Internet Archive has a huge corpus of digital information - all running across and integrated cluster of over 700 VMs on 500+ "bare-metal" hosts. Every day, our team of development engineers creates tools and applications to work with 20 petabytes of content replicated over 3 data centers (50 petabytes raw storage total). This includes millions of books and texts, millions of hours of video, millions of audio tracks, and over 450 billion web captures. We are looking for a smart engineer with deep experience in Development Operations and a passion for modern and evolving technologies and approaches to compute/data cluster operations and software development environments and processes. The ideal candidate will have demonstrated effective approaches to the security, reliability, and performance of large/complicated web applications or services, and will work collaboratively with talented engineering colleagues.

Key Responsibilities: The responsibilities of this position are to be the technical master of all things related to development environments, software and tool evolution, cluster health and data security at the Archive. More specifically, this means:

  • Deploy, administer, and tune tools that support the software development infrastructure
  • Research, assess, select, and deploy tools to support and enhance the aggregate development environment
  • Train newly hired software development engineers in organizational-specific processes and tools as necessary
  • Define, communicate, and enforce the process of software transition from development into production status.
  • Be the "gatekeeper" of the transition of executable code from development into production
  • Analyze current challenges in our development and operational process and infrastructure(s) and propose, validate, and implement effective improvements
  • Participate in monitoring, maintain and (when necessary) restoring the health of the storage and compute cluster

Minimum Qualifications:

  • Fluency in Linux environments (at least 5 years experience)
  • Experience working in and administering cluster-computing environments
  • Scripting and/or programming skills and the ability to develop custom tool integrations
  • Expertise in configuration and management of software development environments and cluster administration tools, including Git-Hub, SVN, BitBucket, Docker, Ansible, Zookeeper, Monitoring Tools: Nagios, Graphite, Kibana etc.
  • Experience maintaining bug tracking, code review, coverage analysis, agile tracking and testing tools
  • Experience deploying and administering database, search, and web-host services.
  • Broad knowledge of build, integration, deployment and provisioning
  • Experience open source practices and participation in open source forums
  • Takes pride in keeping current with industry discoveries and evolutions
  • Networking: basic TCP/IP and UDP understanding. Experience with load balancers, web and DNS servers, certificates, etc. Able to troubleshoot networking and web services issues using nginx and Apache logs as well as network packet captures.
  • Flexibility and a sense of humor
  • BS Computer Science, or equivalent work experience

Preferred Qualifications:

  • Specific experience with Atlassian tool sets
  • Good knowledge of evolving database and search technologies, including Redis-3, Postgres, ElasticSearch, SPARK, and SOLR
  • MS in Computer Science or equivalent work experience
  • Habitually explore new tools and ideas and incorporate them into your everyday work

Reporting Structure:The Senior Engineer for Development Operations reports to the Director of Engineering and will work closely with both the cluster infrastructure team and web application development teams. The entire staff is guided by founder and Digital Librarian, Brewster Kahle. Like many Bay Area technology firms, we offer staffers a catered lunch every Friday, massages, espresso bar, endless M&Ms and working in the coolest office space in SF. But more importantly, you will be helping to make sure we bring out cultural treasures into the digital age, while preserving our present and future. Read more about the Internet Archive in Medium and The New Yorker

.

To Apply: Please send your resume and cover letter to Jobs+SeniorOpsEngineer@archive.org with the subject line "COA-301: Senior Development Operations Engineer"

Internet Archive is an Equal Opportunity Employer. Internet Archive complies with the Fair Chance Ordinance. Internet Archive is a 501(c)(3) non-profit library founded in 1996.

SE-303: Senior Engineer: Search and Data Management

About the Internet Archive: Internet Archive is a 501(c)(3) non-profit library founded in 1996. Our motto is Universal Access to All Knowledge. We collect web sites, books, audio, videos, software, and other types of media and make them available to the world for free. Based in San Francisco and with satellites around the world, the Internet Archive staffers are building the digital library of the future--a place where we can all go to learn and explore. Our 140 engineers, book scanners, librarians, designers and team members have built the #250 website in the world, https://archive.org.

Location: San Francisco, CA

Job Classification: Full-time, exempt

Job Summary: The Internet Archive has a huge corpus of digital information - all ready and waiting for access through modern Search technologies. With millions of books and texts, 100s of 1000s of hours of video, millions of audio tracks, and over 450 billion web captures there's a lot to search. We are looking for a smart engineer with deep experience in Search and a passion for modern and evolving search approaches and technologies. The ideal candidate will possess a desire to work collaboratively with a small internal team and a large, vocal and active user community; demonstrating independence, creativity, initiative and technological savvy, in addition to being a great programmer/architect.

Key Responsibilities: The responsibilities of this position are to be the technical master of all things related to Search at the Archive. More specifically, this means:

  • Improving the Search experience at Archive.org, OpenLibrary.org, and Archive-It.org
  • Ensuring the effective deployment, configuration, and administration of Search technologies within our multi-data-center computation and storage cluster to achieve outstanding search performance and quality.
  • Analyzing current challenges in our search infrastructure(s) and proposing effective improvements.
  • Facilitate the effective integration of Search into multiple web applications as they evolve.

Minimum Qualifications: 3 to 5 years extensive work experience in open-source search systems and technologies, including:

  • ElasticSearch
  • SOLR / Lucene
  • Redis
  • At least 10 years experience working in Linux environments
  • Experience working in cluster-computing environments
  • Knowledge and experience with search optimization an user experience considerations (for both text-centric and metadata-centric searching)
  • Flexibility and a sense of humor
  • BS Computer Science, or equivalent work experience

Preferred Qualifications:

  • Deep knowledge of evolving search technologies, including SPARK, SOLR5, and REdis-3
  • Cluster computing experience
  • Experience open source practices and participation in open source forums
  • MS in Computer Science or equivalent work experience

Reporting Structure:The Senior Engineer for Search reports to the Director of Engineering and will work closely with both the cluster infrastructure team and web application development teams. Like other Bay Area technology firms, we offer staffers a catered lunch every Friday, massages, countless M&M's and working in the coolest office space in San Francisco. But more importantly, you will be helping to make sure we preserve the history of the Digital Age.

To Apply: Please send your resume and cover letter to Jobs+SeniorSearchEngineer@archive.org with the subject line "SE-303: Search Senior Engineer"

Internet Archive is an Equal Opportunity Employer. Internet Archive complies with the Fair Chance Ordinance.

Wayback Machine Senior Engineer

About the Internet Archive: Internet Archive is a 501(c)(3) non-profit library founded in 1996. Our motto is Universal Access to All Knowledge. We collect web sites, books, audio, videos, software, and other types of media and make them available to the world for free. Based in San Francisco and with satellites around the world, the Internet Archive staffers are building the digital library of the future--a place where we can all go to learn and explore. Our 140 engineers, book scanners, librarians, designers and team members have built the #250 website in the world, https://archive.org.

Location: San Francisco, CA

Job Classification: Full-time, exempt

Job Summary: The Internet Archive's Wayback Machine is the world's largest public archive of historical web sites. Have you ever wanted to work with 450 billion things at once? Would you like to serve 1,500 requests per second? How about having your service referred to regularly in news articles and blog posts across the web? You can work on a challenging and popular project and help the world at the same time.

We are looking for a smart, collaborative and resourceful engineer to help develop the next version of the Wayback Machine. The ideal candidate will possess a desire to work collaboratively with a small internal team and a large, vocal and active user community; demonstrating independence, creativity, initiative and technological savvy, in addition to being a great programmer/architect.

Minimum Qualifications:

  • 2-3 years work experience in Python, or similar
  • Experience working in Linux environments
  • Familiarity with Java (current deployment is written in Java)
  • Good understanding of latest web framework technologies and aspects of web technology and protocols
  • Flexibility and a sense of humor
  • BS Computer Science, or equivalent work experience

Preferred Qualifications:

  • Experience with web crawlers and/or applications designed to display archived web content (especially server-side apps)
  • Cluster computing experience
  • Open source practices experience

To apply: Please send your resume and cover letter to Jobs+SeniorWaybackEngineer@archive.org with the subject line "Wayback Machine Senior Engineer."

Internet Archive is an Equal Opportunity Employer. Internet Archive complies with the Fair Chance Ordinance.

Designer

About the Internet Archive: Internet Archive is a 501(c)(3) non-profit library founded in 1996. Our motto is Universal Access to All Knowledge. We collect web sites, books, audio, videos, software, and other types of media and make them available to the world for free. Based in San Francisco and with satellites around the world, the Internet Archive staffers are building the digital library of the future--a place where we can all go to learn and explore. Our 140 engineers, book scanners, librarians, designers and team members have built the #250 website in the world, https://archive.org.

Millions of people use https://archive.org every day, and we would like to make sure they have the best experience possible.

As in-house designer, your primary responsibility will be to drive the design process and execution for the redesign of Archive.org and additional projects. In this capacity, you will report to the Director of Web Services and will additionally work under the guidance of an Advisory Creative Director.

This position has an expected duration of 3-6 months.

Responsibilities

  • Liaise with Advisory Creative Director
  • Produce iterative wireframes for UI design and lead ongoing prototype development
  • Collaborate with engineering team to understand technical needs and limitations
  • Produce iterative mockups for new visual & detail design
  • Produce all final mockups and relevant files for full implementation of final redesign
  • Lead and execute on evaluative research process, including user testing
  • Understand post-launch feedback and follow up with ongoing adjustments as needed
  • Develop recommendations for interaction model and overall redesign strategy

To Apply: Please send your resume and cover letter to Jobs+Designer@archive.org with the subject line "Designer."

Internet Archive is an Equal Opportunity Employer. Internet Archive complies with the Fair Chance Ordinance.

CO-101: Cluster Operations H/W Technician

About the Internet Archive: The Internet Archive is a non-profit with a huge mission: to give everyone access to all knowledge—the books, web pages, audio, television and software of our shared human culture. Forever. Based in San Francisco and with satellites around the world, the Internet Archive staffers are building the digital library of the future--a place where anyone can go to learn and explore. Our 140 engineers, book scanners, librarians, designers and team members have built one of the top websites in the world. (https://archive.org).

Location: San Francisco, CA and the City of Richmond, CA

Job Classification: Full-time, exempt

Job Summary: The Internet Archive has a huge corpus of digital information - all running across an integrated cluster of over 700 VMs on 500+ "bare-metal" hosts. Every day, our team of development engineers creates tools and applications to work with 20 petabytes of content replicated over 3 data centers (50 petabytes raw storage total). This includes millions of books and texts, millions of hours of video, millions of audio tracks, and over 450 billion web captures. We are looking for a methodical and detail-oriented individual with experience in server hardware maintenance and administration. The ideal candidate will have demonstrated reliable and effective work habits supporting the security, reliability, and performance of large server-farm installations, will also have experience in office IT systems, and will work collaboratively with talented colleagues.

Key Responsibilities: The responsibilities of this position are to be the technical master of all things related to hardware systems that support the organizational infrastructure, development environments, cluster health and data security at the Archive. More specifically, this means working within the Cluster Operations team, rotating across multiple assignments and technical tasks:

  • Disk Swaps: Work directly with the team to maintain the healthy operation of our storage cluster with over 20,000 individual high-capacity disk drives across 3 data-center locations. Replace drives as necessary and re-initiate storage availability to the system.
  • Server maintenance and repair: Perform module-level troubleshooting and replacement for high-performance servers within the cluster.
  • Rack repair and maintenance: Perform module-level troubleshooting and replacement for racked hardware systems. Troubleshoot and replace power supplies, boards, and network interfaces.
  • Maintain, troubleshoot and correct wired and wireless network connectivity.
  • As necessary, support laptop and desktop workstations throughout the offices.
  • Use judgment to prioritize tasks are initiated through a job ticketing systems. The specifics of each task must we well-documented back into the job ticketing system once completed.
  • Generally, participate in monitoring, maintaining and (when necessary) restoring the health of the storage and compute cluster.

Minimum Qualifications:

  • Comfort and familiarity with Linux operating system
  • Strong attention to detail
  • Honest, reliable self-starter, able to complete a list of task with little supervision.
  • Familiar with the organization of computer system internal modules (power module, memory modules, etc).
  • Familiarity with basic hand tools and manual dexterity to perform hardware maintenance on computer systems.
  • Familiarity with Mac-OS and Windows operating systems
  • Good understanding of basic networking technologies, hardware, and protocols.
  • Understanding of static and magnetic protocols for electronic system assembly/disassembly
  • Driver’s License and ability to drive (company-owned vehicles).
  • Strong verbal and written communication skills.
  • Comfortable using electronic means of communication with others (instant messaging, email, etc.)
  • Enthusiastic to learn new procedures and skills
  • Flexibility and a sense of humor

Preferred Qualifications:

  • Specific experience with Atlassian Jira
  • Passion for tidiness and order
  • Able to write process documentation.
  • Able to collaborate with others on the team

Reporting Structure:Cluster Operations Hardware Technician reports to the Director of Engineering and will work closely with closely with the cluster infrastructure team. The entire staff is guided by founder and Digital Librarian, Brewster Kahle. Like many Bay Area technology firms, we offer staffers a catered lunch every Friday, massages, espresso bar, endless M&Ms and working in the coolest office space in SF. But more importantly, you will be helping to make sure we bring out cultural treasures into the digital age, while preserving our present and future. Read more about the Internet Archive in Medium and The New Yorker

.

To Apply: Please send your resume and cover letter to Jobs+ClusterOperationsHWTechnician@archive.org with the subject line "CO-101: Cluster Operations Hardware Technician"

Internet Archive will consider for employment-qualified applicants with criminal histories in a manner consistent with the requirements of the Fair Chance Ordinance.

Work Transition Program Manager

About the Internet Archive: Internet Archive is a 501(c)(3) non-profit library founded in 1996. Our motto is Universal Access to All Knowledge. We collect web sites, books, audio, videos, software, and other types of media and make them available to the world for free. Based in San Francisco and with satellites around the world, the Internet Archive staffers are building the digital library of the future--a place where we can all go to learn and explore. Our 140 engineers, book scanners, librarians, designers and team members have built the #250 website in the world, https://archive.org.

Location: Richmond, CA.

The location of the program is in a safe; secure building owned by the Internet Archive in Richmond, California. It is within 1 mile of the Richmond Bart Station and is also 100 yards from a Bus Stop.

Job Classification: Full-time, exempt

Job Summary: Join a passionate, dedicated and bright team who are building a global digital library for the entire world. If you've ever wanted to come to work and feel that your efforts directly make an impact on both your local and the global community, then this position might be for you!

We are not looking for just anyone, we are looking for that special person. Maybe that person is you! We are looking for someone to head up our Work Transition Program.

The Work Transition Program takes formerly hard-to-employ individuals who have completed a 12-month clean and sober program by helping them develop a work history. This 9-month re-learn-to-work program held at the Internet Archive in Richmond, California will teach motivated and eager individuals the skills, habits and behaviors necessary for the contemporary work environment. Successful graduates of the Work Transition Program interview and in most cases earn jobs at national employers such as Costco and UPS, along with local organizations such as the San Francisco Public Library.

We are looking for an outstanding person to be both a manager and mentor to the participants, along with being a key part of the management team at the Internet Archive. This individual should be passionate about helping and developing people. We are looking for a compassionate individual who can dole out tough love when necessary, but also be a humanist who believes that hard working people deserve a second chance. This person should believe that with the right support, leadership, and motivation, individuals can move their life forward and contribute to their families and their communities.

There are three phases to the Work Transition Program, each with increasing responsibility and skills associated with the upward ladder of building a work history. The right person hired for this position will be part of a management team, and lead up to 40 full-time employees. This position utilizes a variety of training modules and production (white/blue collar) work projects related to their personal and professional development. Currently there are 7 participants in the first phase. Additional people will be added every 3-4 months.

The Work Transition Program is a three-year pilot program that was started in 2014. Following the 3-year pilot, the program is anticipated to scale up rapidly. The Work Transition Program is a partnership between the Bay Area Rescue Mission (a 501c3 not-for-profit), the Internet Archive (a 501c3 not-for-profit) and national employers such as Costco, UPS and libraries such as the San Francisco Public Library.

Preferred Qualifications:

  • Experience working with recovering alcoholics, drug addicts and/or victims of domestic abuse and/or an individual who knows they are good at helping develop and mentor people.
  • A team leader who has demonstrated that he/she can build, train and motivate teams of hourly workers in a production environment.
  • Above average knowledge of computers; excel spreadsheets and web search skills a must.
  • Experience with digital photography, image files, books or libraries a plus, but not mandatory.
  • Social workers, addiction specialists, and individuals who come from a warehouse, factory, or retail background are encouraged to apply.
  • The person should enjoy and be comfortable with individuals at the later stage of their recovery who are eager and ready to re-enter the community.
  • BA/BS or equivalent experience.

To Apply: Please send your resume and cover letter to Jobs+WorkTransitionProgramManager@archive.org with the subject line "Work Transition Program Manager."

Internet Archive is an Equal Opportunity Employer. Internet Archive complies with the Fair Chance Ordinance.