nova-api/api/transfer.py

"""Does quite a few checks and prepares the incoming request for the target endpoint, so it can be streamed"""

import json
import yaml
import fastapi

from dotenv import load_dotenv

import streaming
import moderation

from db import users
from helpers import tokens, errors

load_dotenv()

models_list = json.load(open('models.json'))

with open('config/credits.yml', encoding='utf8') as f:
    credits_config = yaml.safe_load(f)

async def handle(incoming_request):
    """Transfer a streaming response from the incoming request to the target endpoint"""

    path = incoming_request.url.path.replace('v1/v1/', 'v1/')

    # METHOD
    if incoming_request.method not in ['GET', 'POST', 'PUT', 'DELETE', 'PATCH']:
        return await errors.error(405, f'Method "{incoming_request.method}" is not allowed.', 'Change the request method to the correct one.')

    # PAYLOAD
    try:
        payload = await incoming_request.json()
    except json.decoder.JSONDecodeError:
        payload = {}

    # Tokenise w/ tiktoken
    try:
        input_tokens = await tokens.count_for_messages(payload['messages'])
    except (KeyError, TypeError):
        input_tokens = 0

    # Check user auth
    received_key = incoming_request.headers.get('Authorization')

    if not received_key:
        return await errors.error(401, 'No NovaAI API key given!', 'Add "Authorization: Bearer nv-..." to your request headers.')

    if received_key.startswith('Bearer '):
        received_key = received_key.split('Bearer ')[1]

    user = await users.by_api_key(received_key.strip())

    if not user:
        return await errors.error(401, 'Invalid NovaAI API key!', 'Create a new NovaOSS API key.')

    ban_reason = user['status']['ban_reason']
    if ban_reason:
        return await errors.error(403, f'Your NovaAI account has been banned. Reason: "{ban_reason}".', 'Contact the staff for an appeal.')

    if not user['status']['active']:
        return await errors.error(418, 'Your NovaAI account is not active (paused).', 'Simply re-activate your account using a Discord command or the web panel.')

    if '/models' in path:
        return fastapi.responses.JSONResponse(content=models_list)

    # Calculate cost of tokens & check for nsfw prompts
    costs = credits_config['costs']
    cost = costs['other']

    policy_violation = False

    if 'chat/completions' in path:
        for model_name, model_cost in costs['chat-models'].items():
            if model_name in payload['model']:
                cost = model_cost

        policy_violation = await moderation.is_policy_violated(payload['messages'])

    elif '/moderations' in path:
        pass

    else:
        inp = payload.get('input', payload.get('prompt'))

        if inp:
            if len(inp) > 2 and not inp.isnumeric():
                policy_violation = await moderation.is_policy_violated(inp)

    if policy_violation:
        return await errors.error(400, f'The request contains content which violates this model\'s policies for "{policy_violation}".', 'We currently don\'t support any NSFW models.')


    role_cost_multiplier = credits_config['bonuses'].get(user['role'], 1)
    cost = round(cost * role_cost_multiplier)

    if user['credits'] < cost:
        return await errors.error(429, 'Not enough credits.', 'Wait or earn more credits. Learn more on our website or Discord server.')


    # Send the completion request

    if 'chat/completions' in path and not payload.get('stream') is True:
        payload['stream'] = False

    media_type = 'text/event-stream' if payload.get('stream', False) else 'application/json'

    return fastapi.responses.StreamingResponse(
        content=streaming.stream(
            user=user,
            path=path,
            payload=payload,
            credits_cost=cost,
            input_tokens=input_tokens,
            incoming_request=incoming_request,
        ),
        media_type=media_type
    )
Added more documentation 2023-08-12 17:49:31 +02:00			`"""Does quite a few checks and prepares the incoming request for the target endpoint, so it can be streamed"""`
First actually working version I guess (streaming support) 2023-06-30 02:49:56 +02:00
Works for BetterGPT 2023-07-19 23:51:28 +02:00			`import json`
some stuff idfk 2023-08-04 03:30:56 +02:00			`import yaml`
moderation is done yay 2023-08-06 21:42:07 +02:00			`import fastapi`
First actually working version I guess (streaming support) 2023-06-30 02:49:56 +02:00
Fixed trademarks 2023-06-28 15:21:14 +02:00			`from dotenv import load_dotenv`
Added load balancer, MongoDB, improved proxy streaming support, more error messages, entire new world order 2023-08-03 01:46:49 +02:00
some stuff idfk 2023-08-04 03:30:56 +02:00			`import streaming`
moderation is done yay 2023-08-06 21:42:07 +02:00			`import moderation`
Added load balancer, MongoDB, improved proxy streaming support, more error messages, entire new world order 2023-08-03 01:46:49 +02:00
Added auto-rewards and invalid key system 2023-08-07 23:28:24 +02:00			`from db import users`
			`from helpers import tokens, errors`
Fixed trademarks 2023-06-28 15:21:14 +02:00
			`load_dotenv()`

Models endpoint is now free and very quick 2023-08-09 11:43:05 +02:00			`models_list = json.load(open('models.json'))`

some stuff idfk 2023-08-04 03:30:56 +02:00			`with open('config/credits.yml', encoding='utf8') as f:`
			`credits_config = yaml.safe_load(f)`

Added load balancer, MongoDB, improved proxy streaming support, more error messages, entire new world order 2023-08-03 01:46:49 +02:00			`async def handle(incoming_request):`
First actually working version I guess (streaming support) 2023-06-30 02:49:56 +02:00			`"""Transfer a streaming response from the incoming request to the target endpoint"""`
WIP 2023-07-25 02:42:53 +02:00
Added auto-rewards and invalid key system 2023-08-07 23:28:24 +02:00			`path = incoming_request.url.path.replace('v1/v1/', 'v1/')`
First actually working version I guess (streaming support) 2023-06-30 02:49:56 +02:00
some stuff idfk 2023-08-04 03:30:56 +02:00			`# METHOD`
Several improvements, about to change DB 2023-08-01 20:19:00 +02:00			`if incoming_request.method not in ['GET', 'POST', 'PUT', 'DELETE', 'PATCH']:`
some general changes 2023-08-12 17:56:21 +02:00			`return await errors.error(405, f'Method "{incoming_request.method}" is not allowed.', 'Change the request method to the correct one.')`
Works for BetterGPT 2023-07-19 23:51:28 +02:00
some stuff idfk 2023-08-04 03:30:56 +02:00			`# PAYLOAD`
Works for BetterGPT 2023-07-19 23:51:28 +02:00			`try:`
WIP 2023-07-25 02:42:53 +02:00			`payload = await incoming_request.json()`
Works for BetterGPT 2023-07-19 23:51:28 +02:00			`except json.decoder.JSONDecodeError:`
WIP 2023-07-25 02:42:53 +02:00			`payload = {}`

some general changes 2023-08-12 17:56:21 +02:00			`# Tokenise w/ tiktoken`
Several improvements, about to change DB 2023-08-01 20:19:00 +02:00			`try:`
moderation is done yay 2023-08-06 21:42:07 +02:00			`input_tokens = await tokens.count_for_messages(payload['messages'])`
Added load balancer, MongoDB, improved proxy streaming support, more error messages, entire new world order 2023-08-03 01:46:49 +02:00			`except (KeyError, TypeError):`
Several improvements, about to change DB 2023-08-01 20:19:00 +02:00			`input_tokens = 0`

some general changes 2023-08-12 17:56:21 +02:00			`# Check user auth`
some stuff idfk 2023-08-04 03:30:56 +02:00			`received_key = incoming_request.headers.get('Authorization')`
Several improvements, about to change DB 2023-08-01 20:19:00 +02:00
some stuff idfk 2023-08-04 03:30:56 +02:00			`if not received_key:`
some general changes 2023-08-12 17:56:21 +02:00			`return await errors.error(401, 'No NovaAI API key given!', 'Add "Authorization: Bearer nv-..." to your request headers.')`
Several improvements, about to change DB 2023-08-01 20:19:00 +02:00
some stuff idfk 2023-08-04 03:30:56 +02:00			`if received_key.startswith('Bearer '):`
			`received_key = received_key.split('Bearer ')[1]`
Several improvements, about to change DB 2023-08-01 20:19:00 +02:00
some stuff idfk 2023-08-04 03:30:56 +02:00			`user = await users.by_api_key(received_key.strip())`
Several improvements, about to change DB 2023-08-01 20:19:00 +02:00
			`if not user:`
some general changes 2023-08-12 17:56:21 +02:00			`return await errors.error(401, 'Invalid NovaAI API key!', 'Create a new NovaOSS API key.')`
Added load balancer, MongoDB, improved proxy streaming support, more error messages, entire new world order 2023-08-03 01:46:49 +02:00
			`ban_reason = user['status']['ban_reason']`
			`if ban_reason:`
some general changes 2023-08-12 17:56:21 +02:00			`return await errors.error(403, f'Your NovaAI account has been banned. Reason: "{ban_reason}".', 'Contact the staff for an appeal.')`
Added load balancer, MongoDB, improved proxy streaming support, more error messages, entire new world order 2023-08-03 01:46:49 +02:00
			`if not user['status']['active']:`
some general changes 2023-08-12 17:56:21 +02:00			`return await errors.error(418, 'Your NovaAI account is not active (paused).', 'Simply re-activate your account using a Discord command or the web panel.')`
Added load balancer, MongoDB, improved proxy streaming support, more error messages, entire new world order 2023-08-03 01:46:49 +02:00
Models endpoint is now free and very quick 2023-08-09 11:43:05 +02:00			`if '/models' in path:`
			`return fastapi.responses.JSONResponse(content=models_list)`

some general changes 2023-08-12 17:56:21 +02:00			`# Calculate cost of tokens & check for nsfw prompts`
some stuff idfk 2023-08-04 03:30:56 +02:00			`costs = credits_config['costs']`
			`cost = costs['other']`
Added load balancer, MongoDB, improved proxy streaming support, more error messages, entire new world order 2023-08-03 01:46:49 +02:00
Fixed some several issues with moderation, models etc. 2023-08-08 01:04:35 +02:00			`policy_violation = False`
moderation is done yay 2023-08-06 21:42:07 +02:00
some stuff idfk 2023-08-04 03:30:56 +02:00			`if 'chat/completions' in path:`
			`for model_name, model_cost in costs['chat-models'].items():`
			`if model_name in payload['model']:`
			`cost = model_cost`
Added load balancer, MongoDB, improved proxy streaming support, more error messages, entire new world order 2023-08-03 01:46:49 +02:00
Fixed some several issues with moderation, models etc. 2023-08-08 01:04:35 +02:00			`policy_violation = await moderation.is_policy_violated(payload['messages'])`

			`elif '/moderations' in path:`
			`pass`
moderation is done yay 2023-08-06 21:42:07 +02:00
			`else:`
			`inp = payload.get('input', payload.get('prompt'))`

Added auto-rewards and invalid key system 2023-08-07 23:28:24 +02:00			`if inp:`
			`if len(inp) > 2 and not inp.isnumeric():`
Fixed some several issues with moderation, models etc. 2023-08-08 01:04:35 +02:00			`policy_violation = await moderation.is_policy_violated(inp)`
Added auto-rewards and invalid key system 2023-08-07 23:28:24 +02:00
Fixed some several issues with moderation, models etc. 2023-08-08 01:04:35 +02:00			`if policy_violation:`
some general changes 2023-08-12 17:56:21 +02:00			`return await errors.error(400, f'The request contains content which violates this model\'s policies for "{policy_violation}".', 'We currently don\'t support any NSFW models.')`

moderation is done yay 2023-08-06 21:42:07 +02:00
some stuff idfk 2023-08-04 03:30:56 +02:00			`role_cost_multiplier = credits_config['bonuses'].get(user['role'], 1)`
			`cost = round(cost * role_cost_multiplier)`
Added load balancer, MongoDB, improved proxy streaming support, more error messages, entire new world order 2023-08-03 01:46:49 +02:00
			`if user['credits'] < cost:`
some general changes 2023-08-12 17:56:21 +02:00			`return await errors.error(429, 'Not enough credits.', 'Wait or earn more credits. Learn more on our website or Discord server.')`

Several improvements, about to change DB 2023-08-01 20:19:00 +02:00
some general changes 2023-08-12 17:56:21 +02:00			`# Send the completion request`
Fixed trademarks 2023-06-28 15:21:14 +02:00
sleep deprivation caused me to be not productive today 2023-08-06 00:43:36 +02:00			`if 'chat/completions' in path and not payload.get('stream') is True:`
some stuff idfk 2023-08-04 03:30:56 +02:00			`payload['stream'] = False`
Work in progess 2023-08-03 03:50:04 +02:00
moderation is done yay 2023-08-06 21:42:07 +02:00			`media_type = 'text/event-stream' if payload.get('stream', False) else 'application/json'`

			`return fastapi.responses.StreamingResponse(`
some stuff idfk 2023-08-04 03:30:56 +02:00			`content=streaming.stream(`
			`user=user,`
			`path=path,`
			`payload=payload,`
			`credits_cost=cost,`
			`input_tokens=input_tokens,`
			`incoming_request=incoming_request,`
			`),`
moderation is done yay 2023-08-06 21:42:07 +02:00			`media_type=media_type`
some stuff idfk 2023-08-04 03:30:56 +02:00			`)`