Sub-processors#

A sub-processor is a third-party service that processes data on behalf of 2i2c while we operate infrastructure for a community. This page describes the sub-processors used in a typical hub deployment and the kinds of personal data each one handles. The specifics differ per hub (e.g., the cloud region where data is stored, or any add-on services), so the list below is not exhaustive. See How to find the data processors for a specific hub below to determine the list for a particular hub.

This is not a legal document

This document is provided to set expectations and understanding about 2i2c’s cloud infrastructure service. It is not legally binding.

Typical sub-processors#

The services below are common touchpoints for user data (e.g., their identity, files they create, etc).

Cloud or infrastructure provider#

An infrastructure provider (like a cloud provider) hosts the core infrastructure that runs the hub. This includes compute, storage, and logging. The cloud region is configured per hub and determines where this data is stored. 2i2c either manages its own cloud provider account, or uses one managed by the community.

Personal data processed by this sub-processor:

  • User home directory contents (e.g., notebooks, data files created during use)

  • The JupyterHub username / authentication database

  • System and access logs

Here are a few common cloud providers we use and their sub-processors pages:

Identity provider#

The service used to authenticate users when they log in. The hub uses these providers to authenticate but does not store authentication credentials on them. The user or their home institution has a direct relationship with the identity provider.

Personal data processed by this sub-processor (exchanged at login only):

  • Username

  • Email address (depending on the provider)

  • Membership in a designated organization (where applicable)

Here are a few common ones we use:

Note: If a hub uses CILogon, the user’s home institution is itself the identity provider and effectively the relevant sub-processor for that user’s authentication data.

How to find the data processors for a specific hub#

The sub-processors used by a given hub are determined by its configuration in the 2i2c infrastructure repository. This will usually involve digging through .yaml configuration to find the services that your hub uses. If you need this information, please open a support ticket.