GraphQL apis for SWH
Closed, MigratedEdits Locked
Actions

Assigned To

Authored By

	jayeshv
	Jun 23 2021, 5:03 PM

Description

REST APIs are great; but, at times, it is too hard to write clients just using resource based CRUD operations. It is especially true for a system like SWH with a lot of centralized relational data. A GraphQL API layer could be a good addition to make data retrieval easy and efficient.
A graph API will let a client application to get the exact amount of data it requires in a single request. Requested attributes could be part of different resources from a REST point of view.

Most of the SWH APIs are read only. This will make the implementation of the graph layer particularly easy, we only have to think about a schema and mostly forget mutations.
GraphQL will also reduce the complexity of versioning the REST APIs. This layer can be added without disturbing any of the existing API contracts. It would also be possible for REST APIs to evolve independently.

One example use case:
A third party user is creating a data dashboard for showing all the projects newly archived in 2021.
With the current APIs, it will take many back and forth trips to gather this related data to populate the client's page.
1: To get all the origins.
2: Get visits for each origin to identify the archived date. (It might be possible to filter by an archived date in the previous request, haven't found that in the docs)
3: Get the metadata for the origin for further information.
This could be a difficult and time taking task to code in the client. This will also cause unnecessary load in the server.
With a GraphQL query, it would be possible to gather all this related data (depending on the published schema) in a single request.
This will also avoid complex client side validations (they get what they requested for) and make any JSON unmarshalling easier.

A few possible implementation ideas are
1: As a new package in SWH-web.
A graph endpoint can be exposed in the current Django system. Existing code used by the REST can be used to gather data for the graph API.
Major disadvantages are:

It will make the already big SWH-Web bigger.
Complexity for gathering data is mostly in the storage project. So the possibility of code re-use is actually minimal.

2: As a new service using existing REST APIs.
A new service that in turn calls the existing APIs to gather data. This could be useful in case we have to split the existing APIs to multiple micro services in the future.
Some disadvantages are:

This will be dependent a lot on the existing API. We will be forced to add a REST API before even exposing something in GraphQL schema.
REST could be an unnecessary level for those using graph APIs.
Both APIs may have to go through the same auth, throttling checks.

3: As a new service with direct calls to other services.
This way, we can have an independent Graph API layer. This layer would be free to call other services or the existing APIs to gather data.
One possible problem is that going forward, resources and attributes in REST and Graph APIs might differ a bit.

The 3rd approach seems to be the best considering the current code base.

related to T1805

Related Objects
Search...

Status	Assigned	Task
Migrated	gitlab-migration	T4083 New public API (GraphQL + thin layer)
Migrated	gitlab-migration	T3405 GraphQL apis for SWH
Migrated	gitlab-migration	T3556 Implement a generic pagination support for the GraphQL response.
Migrated	gitlab-migration	T3932 Define a GraphQL schema
Migrated	gitlab-migration	T3933 Decide on the libraries to use for GraphQL server
Migrated	gitlab-migration	T3984 Structure and design for swh-graphql
Migrated	gitlab-migration	T4103 Setup a staging environment for GraphQL APIs
Migrated	gitlab-migration	T4135 staging: Deploy graphql service
Migrated	gitlab-migration	T4413 Deploy argocd on admin vlan

Event Timeline

jayeshv created this task.Jun 23 2021, 5:03 PM

vlorentz triaged this task as Normal priority.Jun 23 2021, 5:20 PM

jayeshv claimed this task.Jun 29 2021, 3:57 PM

I stumbled across GitLab GraphQL API while working on T3442, could be a great source of inspiration.

In T3405#67656, @anlambert wrote:

I stumbled across GitLab GraphQL API while working on T3442, could be a great source of inspiration.

Thanks @anlambert . I somehow missed your comment. I will have a look at the docs.

Since we're listing GraphQL APIs for forges, SourceHut has a brand new one too: https://man.sr.ht/graphql.md (especially git.sr.ht)

And it's implemented in Python

jayeshv updated the task description. (Show Details)Feb 14 2022, 2:50 PM

jayeshv edited projects, added GraphQL API; removed Web app.Mar 28 2022, 3:58 PM

jayeshv changed the status of subtask T3556: Implement a generic pagination support for the GraphQL response. from Open to Work in Progress.Mar 28 2022, 4:01 PM

jayeshv changed the status of subtask T3932: Define a GraphQL schema from Open to Work in Progress.

jayeshv closed subtask T3933: Decide on the libraries to use for GraphQL server as Resolved.

jayeshv changed the status of subtask T3984: Structure and design for swh-graphql from Open to Work in Progress.

jayeshv added a parent task: T4083: New public API (GraphQL + thin layer).May 16 2022, 10:28 AM

jayeshv closed subtask T3984: Structure and design for swh-graphql as Resolved.May 31 2022, 5:47 PM

jayeshv closed subtask T3932: Define a GraphQL schema as Resolved.

jayeshv closed subtask T3556: Implement a generic pagination support for the GraphQL response. as Resolved.

ardumont closed subtask T4103: Setup a staging environment for GraphQL APIs as Resolved.Sep 9 2022, 2:48 PM

gitlab-migration changed the status of subtask T3556: Implement a generic pagination support for the GraphQL response. from Resolved to Migrated.Jan 8 2023, 4:35 PM

gitlab-migration changed the status of subtask T3932: Define a GraphQL schema from Resolved to Migrated.

gitlab-migration changed the status of subtask T3933: Decide on the libraries to use for GraphQL server from Resolved to Migrated.

gitlab-migration changed the status of subtask T3984: Structure and design for swh-graphql from Resolved to Migrated.

gitlab-migration changed the status of subtask T4103: Setup a staging environment for GraphQL APIs from Resolved to Migrated.

This task has been migrated to GitLab.

GraphQL apis for SWHClosed, MigratedEdits LockedActions

Description

Related ObjectsSearch...

Event Timeline

GraphQL apis for SWH
Closed, MigratedEdits Locked
Actions

Related Objects
Search...