epic: Basic Automated test for Installation & Inference on compatible Hardware & OS #1147

dan-menlo · 2024-09-08T02:47:09Z

Goal

We should test for full e2e lifecycle of Cortex's main use case
This only covers llama.cpp for v0.1

Existing Work

Current e2e test
Previous Google Spreadsheet
Nicole to add Hardware Compatibility Matrix
Cortex.cpp: Hardware & OS Compatibility Matrix [WIP] #1125

Existing Bugs

Prev bug with appimage and Fedora bug: Cortex exited with code null immediately after loading a model #1105

Test Cases

Test Harness

epic: unitest for cortex cpp #1069

Installation & Uninstallation

Test successful installation on all OS and hardware Cortex.cpp: Hardware & OS Compatibility Matrix [WIP] #1125

Starting

Starts within X seconds (previous bug: bug: Slow load on Windows on first time run of Cortex #929)

Model Running

Successful load of cached model (e.g. tinyllama)
Successful inference request of cached model
Error messages are tested
Successful unloading of model
Ensure dylib issue is covered bug: libengine.dylib not found #953

Stopping

Successfully stops with no dangling processes

Uninstallation

Successfully uninstalls with no dangling files

The text was updated successfully, but these errors were encountered:

namchuai · 2024-09-10T18:32:53Z

For end to end testing. I'm proposing to use pytest.

Advantages

Easy testing our CLI and API server
Easy to write test as well as generate from AI

freelerobot · 2024-09-30T06:00:17Z

Current Manual QA Checklist

It is a combination of:

unit tests (soon to be automated by feat: e2e tests for APIs #1357)
integration & e2e tests (automation still needed)

Across the following:

Windows 11 (online & offline)
Ubuntu 24, 22 (online & offline)
Mac Silicon OS 14/15 (online & offline)
Mac Intel (online & offline)

I'm documenting this for @gabrielle-ong to ensure no tests fall through the cracks and eventually we have coverage across various kinds of tests.

Installation/Uninstallation

Basic Commands

cortex returns helpful text in a timely* way
cortex -v should check output current version and check for updates
it should correctly log to cortex-cli

Hardware Detection [WIP]

TODO

Server

it should start server
it should stop server
it should correctly log to cortex logs
it should return server status ps

Engines

it should CRUD engines
it should gracefully refine engine installation if interrupted halfway
it should update engines
it should gracefully handle when users try to CRUD incompatible engines
it should run gguf models on llamacpp
it should run trtllm models on trt-llm

Model Management

it should pull by built in model_ID
it should pull by built-in model_id:variant
it should pull by HF repo/model ID
it should pull by partial HF url
it should pull by full HF url (ending in .gguf)
it should resume pull after interruption
it should CRUD downloaded models
it should correctly update state in /models
it should import models

Model Running

run should download missing models
run works on already downloaded models
run should autostart server
chat works

With Hardware Acceleration

it should auto offload max ngl
it should correctly detect available GPUs
it should gracefully detect missing dependencies/drivers

gabrielle-ong · 2024-10-22T10:05:06Z

Updated for v1.0.1 (Manual QA & API tests) #1535
to capture changes eg

model management syntax
recommend default model
llama.cpp installed by default
API tests
Closing this issue in favour of iterating the QA list with each update

dan-menlo · 2024-10-22T13:26:18Z

@gabrielle-ong Even though we are choosing to go with a Manual test for now, we should create an open Ticket for an Automated Test and put it in Icebox.

We should be careful of "QA Debt", and not let it grow too much

dan-menlo added this to Menlo Sep 8, 2024

dan-menlo converted this from a draft issue Sep 8, 2024

dan-menlo moved this to In Progress in Menlo Sep 8, 2024

dan-menlo assigned namchuai, nguyenhoangthuan99, vansangpfiev and dan-menlo Sep 8, 2024

dan-menlo mentioned this issue Sep 8, 2024

bug: Slow load on Windows on first time run of Cortex #929

Closed

dan-menlo added the category: tests QA automations, tests label Sep 8, 2024

dan-menlo changed the title ~~epic: Cortex MVP of automated testing v0.1~~ epic: Cortex MVP of automated QA Testing v0.1 Sep 8, 2024

dan-menlo removed their assignment Sep 8, 2024

dan-menlo changed the title ~~epic: Cortex MVP of automated QA Testing v0.1~~ epic: Basic Automated test for Installation & Inference on supported Hardware & OS Sep 8, 2024

dan-menlo assigned hiento09 Sep 8, 2024

dan-menlo changed the title ~~epic: Basic Automated test for Installation & Inference on supported Hardware & OS~~ epic: Basic Automated test for Installation & Inference on compatible Hardware & OS Sep 8, 2024

dan-menlo mentioned this issue Sep 8, 2024

bug: Cortex exited with code null immediately after loading a model #1105

Closed

4 tasks

dan-menlo unassigned vansangpfiev, nguyenhoangthuan99 and hiento09 Sep 9, 2024

freelerobot added the P1: important Important feature / fix label Sep 9, 2024

namchuai mentioned this issue Sep 11, 2024

feat: add pytest for e2e testing #1188

Merged

10 tasks

freelerobot assigned freelerobot and unassigned namchuai Sep 26, 2024

freelerobot moved this from QA to Planning in Menlo Sep 26, 2024

freelerobot added this to the v0.1.1 milestone Sep 26, 2024

dan-menlo moved this from Planning to Scheduled in Menlo Sep 29, 2024

This was referenced Sep 30, 2024

epic: Structure Manual QA for cortex.cpp #1225

Closed

feat: e2e tests for APIs #1357

Closed

dan-menlo removed this from the v0.1.1 milestone Oct 3, 2024

dan-menlo added this to the v1.0.0 milestone Oct 3, 2024

gabrielle-ong modified the milestones: v1.0.0, v1.0.2 Oct 14, 2024

gabrielle-ong closed this as completed Oct 22, 2024

github-project-automation bot moved this from Scheduled to Review + QA in Menlo Oct 22, 2024

gabrielle-ong moved this from Review + QA to Completed in Menlo Oct 22, 2024

gabrielle-ong mentioned this issue Oct 22, 2024

epic: Cortex v1.0.1 QA (22 Oct 2024) [nightly v192] #1535

Closed

92 tasks

gabrielle-ong mentioned this issue Nov 6, 2024

epic: QA Cortex v1.0.3 #1604

Open

TC117 mentioned this issue Nov 7, 2024

QA: [Nightly - 227] #1648

Open

TC117 mentioned this issue Dec 19, 2024

QA: [1.0.5] #1813

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

epic: Basic Automated test for Installation & Inference on compatible Hardware & OS #1147

epic: Basic Automated test for Installation & Inference on compatible Hardware & OS #1147

dan-menlo commented Sep 8, 2024 •

edited

Loading

namchuai commented Sep 10, 2024

freelerobot commented Sep 30, 2024 •

edited

Loading

gabrielle-ong commented Oct 22, 2024

dan-menlo commented Oct 22, 2024

epic: Basic Automated test for Installation & Inference on compatible Hardware & OS #1147

epic: Basic Automated test for Installation & Inference on compatible Hardware & OS #1147

Comments

dan-menlo commented Sep 8, 2024 • edited Loading

Goal

Existing Work

Existing Bugs

Test Cases

Test Harness

Installation & Uninstallation

Starting

Model Running

Stopping

Uninstallation

namchuai commented Sep 10, 2024

freelerobot commented Sep 30, 2024 • edited Loading

Current Manual QA Checklist

Installation/Uninstallation

Basic Commands

Hardware Detection [WIP]

Server

Engines

Model Management

Model Running

With Hardware Acceleration

gabrielle-ong commented Oct 22, 2024

dan-menlo commented Oct 22, 2024

dan-menlo commented Sep 8, 2024 •

edited

Loading

freelerobot commented Sep 30, 2024 •

edited

Loading