WIP

norton120 · 2024-06-16T22:20:32Z

this may be a long-running branch since cutting the tests over to use httpx app + FastAPI dependency injection is gonna be a bit of work.

Preamble

The Database(s) that support the application state, agent memory (including vector lookup) and the application itself (user/org management, permissions, settings config etc) interface with the rest of the codebase via a MetadataStore object.

Goals here

The metadatastore stays as a gateway for now, but all the configuration gets conventionalized to each adapter type. Overrides need to happen in the config stack (so 1. envars 2. config file 3. default (lives in the adapter)). Don't start moving to doing ORM'y stuff here yet, keep this focused on config squashing.
Way more test hooks. We want to start seeing unit tests in this PR, the best way to do that is to add override hooks to the existing classes where they are useful and break these down more.

norton120 · 2024-06-16T22:21:07Z

@yoaquim this is the working PR we were talking about

norton120 · 2024-06-17T12:13:26Z

K - thinking through where the complication that prevents us using the orm directly, it's really only the archive. So if we add accessors on the related objects, the adapter can probably obfuscate that complication.
Something like

current_agent = authed_user.agents.get(agent_id)
# here's the magic
# archive_memory is not necessarily a sqlalchemy model
return current_agent.archive_memory.search(search_value)

In this case the adapter interface duck types as an orm - so with the pgvector adapter archive_memory is just a model, in SQLite it is a chroma wrapper.

norton120 · 2024-06-18T23:41:03Z

@cpacker @sarahwooders do you know if the init.sql file at the top level of the repo is for deployment? creating the initial user/password/db for the docker image would just be setting those envars

I'd like to create the test db in the docker db init, ideally, I'd like to not add a second init file and switch them around, so that's why I'm trying to track down what it is used for at the moment

norton120 · 2024-06-19T14:45:04Z

@cpacker @sarahwooders do you know if the init.sql file at the top level of the repo is for deployment? creating the initial user/password/db for the docker image would just be setting those envars

I'd like to create the test db in the docker db init, ideally, I'd like to not add a second init file and switch them around, so that's why I'm trying to track down what it is used for at the moment

For the moment I dumped into that init, overriding it without disturbing it is a bit of work. Can revisit before we start merging.

norton120 · 2024-06-26T17:40:54Z

OK. So the shortest path I can see from here is:

add alembic migrations
move to migration and connection instead of create_all (because that won't work anymore)
overload the metadatastore methods to get parity - this should expose the chroma conflict naturally
solve for chroma/pgvector as an overloaded model in the ORM
get all tests passing, merge in all upstream changes
delete all the dead code. there will be a lot. there already is.

sarahwooders · 2024-07-06T18:02:01Z

memgpt/orm/agent.py

+    name:Mapped[Optional[str]] = mapped_column(String, nullable=True, doc="a human-readable identifier for an agent, non-unique.")
+    persona: Mapped[str] = mapped_column(doc="the persona text for the agent, current state.")
+    # TODO: reconcile this with persona,human etc AND make this structured via pydantic!
+    # TODO: these are vague and need to be more specific and explained. WTF is state vs _metadata?


state is the in-process state of the agent (e.g. the memory, in-context message IDs) while metadata is arbitrary info the user might want to attach to the agent

is working. Next up: - isolate the test_server failing tests - move the settings mock into a conftest fixture - add a test hook for SyncServer so you can do the same thing there. - propigate.

for default persona, human, and preset. Now all derived from settings (which is in turn derived from envars). Still need to square away with the config file hierarchy, so once we resolve the value there is only one definitive source of truth across the rest of the code.

hit by a bus the next person doesn't need to spend a week getting up to speed. This helps clarify the goal in this PR: one config hierarchy assembled once, with one mega hook.

TODO: - mount the test sqlite/chroma somewhere that doesn't clutter up the repo

…ep things clean

…le stripping out extraneous elements. The memory thing needs to be abstracted in a later time, never clear if these are strings or templates or references to a related object

…to clarify https://www.notion.so/Data-Model-Questions-43ef1336483f49c1bf77daddf3f320fa

…le entrypoint to be good to go

…to be helpers like palm to do migrations and such

…scheme. the settings.backend object is self-contained, so no more external double-setting

… stub everything over to ORM models.

1. the metadata.py file is being updated to use the ORM 2. conflicting models are being sunset and/or quarantined for this PR 3. CRUD accessors stay in metadatastore but are now managed behind the scenes by the ORM This is going to break a lot of things (which is goodTo get unbroken: 1. update the tests to no longer be aware of the backend configs 2. update the code to same 3. remove all the SQLModel and deprecated backend code 4. document (loom) how the ORM works, how to create migrations, how to traverse the ORM tree etc etc. Strategy here should be to merge this into a long-running branch and start CI against it, then keep pulling main into it until we're ready for a major release (this will be a major). Configs will be extremely thin after this PR. We should be set up to move docker dev to a single stack and docker quickstart to a single image.

…sses still a mess, but I think we can get around that to get things working

stop spinning up servers in tests! But... 1. need to move the db_session all the way up to the request (where it belongs). 2. dep inject that thing at request time! 3. dep override it in conftest!

sarahwooders self-requested a review June 17, 2024 04:14

norton120 force-pushed the feature/1437/condense-configs branch from 393f982 to eb50aff Compare June 19, 2024 21:19

norton120 mentioned this pull request Jul 2, 2024

feat: refactor CoreMemory to support generalized memory fields and memory editing functions #1479

Merged

norton120 force-pushed the feature/1437/condense-configs branch from 7cafef7 to c159afc Compare July 3, 2024 15:17

sarahwooders reviewed Jul 6, 2024

View reviewed changes

sarahwooders changed the base branch from main to 1.0.0-pre July 9, 2024 22:40

sarahwooders approved these changes Jul 9, 2024

View reviewed changes

sarahwooders marked this pull request as ready for review July 9, 2024 22:40

norton120 added 17 commits July 9, 2024 18:50

logs red to green

23b6082

logs reflect debug status

ac0572d

import submodule

dc33421

using memgpt logger not global logger

2488380

found the bug duplication

39fe6f2

black

c74109b

isort

ca93678

placeholder while thinking

20bd4cc

removing dead code to make it easier to refactor

57da67a

starting in on abstracting the metadatastore adapters

56368a0

most of the initial config override in test_server

92f136e

is working. Next up: - isolate the test_server failing tests - move the settings mock into a conftest fixture - add a test hook for SyncServer so you can do the same thing there. - propigate.

abstracted fixture

72b88c7

moving more to fixtures

af7bb37

defaults

6e6d310

conflicting persist

094abd0

Started a working README for refactor, so if I get

3e6e3bb

hit by a bus the next person doesn't need to spend a week getting up to speed. This helps clarify the goal in this PR: one config hierarchy assembled once, with one mega hook.

norton120 and others added 27 commits July 9, 2024 19:09

almost have the conftest pattern set up

73c636c

This is the basic 2 backend pattern.

62da646

TODO: - mount the test sqlite/chroma somewhere that doesn't clutter up the repo

conftest respects relationships

2ea713b

sqlite now stores all the test databases in the .persist folder to ke…

d0d18d7

…ep things clean

updating readme

0187d81

more readme

732cbbb

more models, bringing up lots of questions about the data model

347c9f0

I'm trying to keep this as close to the current model as possible whi…

7cae7cf

…le stripping out extraneous elements. The memory thing needs to be abstracted in a later time, never clear if these are strings or templates or references to a related object

basic ORM pattern for most objects

0f3e8c8

presets model started, lots of questions. pushing to this notion doc …

4e22369

…to clarify https://www.notion.so/Data-Model-Questions-43ef1336483f49c1bf77daddf3f320fa

pretty sure this is the current model

4347d39

alembic-managed migrations

90349f0

migrations now included on startup. we need to add it to every possib…

b8391a9

…le entrypoint to be good to go

added jobs model

9f0b8dc

configs pattern for now. these should be 1st class. also there needs …

d11f87b

…to be helpers like palm to do migrations and such

finally time to start cutting

c4e45b0

pg_uri is now the _only_ db setting. it will always have the correct …

1c2ae94

…scheme. the settings.backend object is self-contained, so no more external double-setting

chewing through all the redundant crud methods in metadata to ideally…

2c3ca0d

… stub everything over to ORM models.

hacking out SQLModel

7f9a7a3

last of the sqlmodel models

585a3cb

stripped all the SQLModel, pydantic schemas vs dataclasses vs arb cla…

ec0b328

…sses still a mess, but I think we can get around that to get things working

cleanup

ed09718

migrations now all the way up on both sqlite and pg

fe38a1f

minimal orm test passes

3fb3931

Conftest ALMOST has a scoped test app so we can

37a04a2

stop spinning up servers in tests! But... 1. need to move the db_session all the way up to the request (where it belongs). 2. dep inject that thing at request time! 3. dep override it in conftest!

breakpoint to rebase

7f4711e

norton120 force-pushed the feature/1437/condense-configs branch from 94ce074 to 7f4711e Compare July 9, 2024 23:11

sarahwooders merged commit efd4b8b into letta-ai:1.0.0-pre Jul 10, 2024
1 of 5 checks passed

norton120 deleted the feature/1437/condense-configs branch July 15, 2024 11:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP - Condense configurations into conventions for Database (Metadatastore) Adapters #1460

WIP - Condense configurations into conventions for Database (Metadatastore) Adapters #1460

norton120 commented Jun 16, 2024

norton120 commented Jun 16, 2024

norton120 commented Jun 17, 2024

norton120 commented Jun 18, 2024

norton120 commented Jun 19, 2024

norton120 commented Jun 26, 2024

sarahwooders Jul 6, 2024

WIP - Condense configurations into conventions for Database (Metadatastore) Adapters #1460

WIP - Condense configurations into conventions for Database (Metadatastore) Adapters #1460

Conversation

norton120 commented Jun 16, 2024

WIP

Preamble

Goals here

norton120 commented Jun 16, 2024

norton120 commented Jun 17, 2024

norton120 commented Jun 18, 2024

norton120 commented Jun 19, 2024

norton120 commented Jun 26, 2024

sarahwooders Jul 6, 2024

Choose a reason for hiding this comment