csvbase is a simple website for sharing table data. Join the discord.

The 8,000 most-downloaded packages from PyPIapprox. 8,000 rows, last changed 1 week ago

Table of contents

  1. The basics: auth and content negotations
    1. Authentication
    2. Content negotiation
  2. The API: endpoint-by-endpoint
    1. Tables
      1. Reading a table
    2. Rows
      1. Creating a new row
      2. Reading a row
      3. Updating an existing row
      4. Deleting a row

The basics: auth and content negotiation

Authentication

With CSVBase, you authenticate using by putting your username and API key straight in the url (known as "HTTP "basic" auth").

Here's an example:

https://<some_user>:<some_api_key>@csvbase.com/calpaterson/top-pypi-packages-30-days

Basic auth is widely supported and is usually accepted anywhere that accepts urls.

However, calpaterson/top-pypi-packages-30-days is public so auth is needed only for writes.

Content negotiation

CSVBase APIs use content negotiation to decide what formats are in use. This means it consults HTTP headers to decide what format to send back in response to a request.

It important that you set the Content-Type and Accept headers to be the mimetype you want: typically that is application/json for both. If you fail to include these headers in your requests, the API will still work but CSVBase will pick a sensible default: CSV for tables, JSON for rows.

You can bypass content negotiation for read-only requests by appending a file extension to the url, eg .json. Here's an example of that (same resource as above):

https://<some_user>:<some_api_key>@csvbase.com/calpaterson/top-pypi-packages-30-days.json

This is useful when dealing with software where you aren't able to set headers.

The API: endpoint-by-endpoint

There are three kinds of thing in csvbase:

  1. users
  2. tables
  3. rows

While there's no API for users so far, there is for tables and rows.

Tables

This table looks like this in JSON:

{
    "name": "top-pypi-packages-30-days",
    "is_public": true,
    "caption": "The 8,000 most-downloaded packages from PyPI",
    "data_licence": "Unknown",
    "created": "2024-05-10T13:43:47.303655+01:00",
    "last_changed": "2024-09-01T12:31:14.596328+01:00",
    "columns": [
        {
            "name": "csvbase_row_id",
            "type": "integer"
        },
        {
            "name": "download_count",
            "type": "integer"
        },
        {
            "name": "project",
            "type": "string"
        }
    ],
    "approx_size": 8000,
    "page": {
        "rows": [
            {
                "row": {
                    "download_count": 1380415659,
                    "project": "boto3"
                },
                "row_id": 1,
                "url": "https://csvbase.com/calpaterson/top-pypi-packages-30-days/rows/1"
            }
        ],
        "previous_page_url": null,
        "next_page_url": "https://csvbase.com/calpaterson/top-pypi-packages-30-days?op=gt&n=1"
    }
}

Note that there is the top-level metadata, plus a "page" of rows. Tables are often (usually) too big to be put into a single JSON object so they are "paginated". To follow the table, page by page, you can use the next_page_url and previous_page_url dictionary keys. They will be null if you've reached the end or are at the beginning, respectively.

Reading a table

GET from https://csvbase.com/calpaterson/top-pypi-packages-30-days

You'll need to follow the next_page_url urls (described above) to get to the end of the table.

Rows

Rows from calpaterson/top-pypi-packages-30-days look like this in JSON:

{
    "row": {
        "download_count": 1380415659,
        "project": "boto3"
    },
    "row_id": 1,
    "url": "https://csvbase.com/calpaterson/top-pypi-packages-30-days/rows/1"
}

Creating a new row

POST to https://<some_user>:<some_api_key>@csvbase.com/calpaterson/top-pypi-packages-30-days/rows/

Example body
{
    "row": {
        "download_count": 1380415659,
        "project": "boto3"
    }
}
Example response
{
    "row": {
        "download_count": 1380415659,
        "project": "boto3"
    },
    "row_id": 1,
    "url": "https://csvbase.com/calpaterson/top-pypi-packages-30-days/rows/1"
}

Status code 201 upon success.

Reading a row

GET from https://csvbase.com/calpaterson/top-pypi-packages-30-days/rows/1

No body is provided with this request. Status code 200 upon success.

Example response
{
    "row": {
        "download_count": 1380415659,
        "project": "boto3"
    },
    "row_id": 1,
    "url": "https://csvbase.com/calpaterson/top-pypi-packages-30-days/rows/1"
}

Updating an existing row

PUT to https://<some_user>:<some_api_key>@csvbase.com/calpaterson/top-pypi-packages-30-days/rows/1

Example body
{
    "row": {
        "download_count": 1380415659,
        "project": "boto3"
    },
    "row_id": 1,
    "url": "https://csvbase.com/calpaterson/top-pypi-packages-30-days/rows/1"
}
Response

Upon success the body you sent will be echoed back, with status code 200.

Deleting a row

DELETE from https://<some_user>:<some_api_key>@csvbase.com/calpaterson/top-pypi-packages-30-days/rows/1

No body is required. Status code 204 upon success.