Move away from sqlite #307

tobias · 2015-03-17T18:44:20Z

This is to track moving away from sqlite to something we can access
concurrently. We've had quite a few bugs related to concurrency issues
with sqlite (#266, #115, #105), and we can't do automatic promotion
with it. It's also possible to lock the database from outside of the
app when doing maintenance (as touched on in #226).

@technomancy and @xeqi have done quite a bit of work on using a
log-based event stream system. The event logging is already happening
in production, and there is work on the less-sql branch to use the log
instead of sqlite for queries. You can read about that work on
the mailing list. If we want to continue with the event log approach,
at least the following things need to happen:

the less-sql branch needs to be brought up to date
command-line maintenance needs to write entries to the log - the
current shell scripts only alter the sqlite db, so no deletion
events get written to the log. It may be worthwhile to have lein
aliases to write these deletion events to the log.
if we do have processes other than the app itself writing to the
logs, we'll need to move to file-based locking instead of using
clojure's locking macro to prevent log corruption
we'll also need some way to tell the app to reload the logs if other
processes can write to them (a file watcher perhaps?)

We don't currently have the backup instance running, but if we do plan
to bring that back, we need to take that in to consideration (file
locking may be enough here if both instances share the same log file
set.

An alternative approach would be to move to a different database that
better handles concurrency (postgres, etc). That delegates locking and
corruption issues to the db, but adds infrastructure complexity.

tobias · 2015-03-23T13:05:12Z

I've rebased the work from the less-sql branch, and updated it to the point where all of the tests are at least passing. I've pushed that work to the event-log branch. It wasn't a clean rebase, given that the less-sql branch was a couple of years old, and lot has changed since then. So even though the tests are passing, my current confidence in this branch is low.

tobias · 2015-04-20T01:13:07Z

Other options:

Move to h2 or some other embedded java db that supports concurrency. This may be doable with minimal changes, but we would still need some way make admin updates. I believe h2 has a CLI client that can connect to the embedded instance.
Serialize writes to sqlite via an agent, queue, or similar. We would need a way to wait for and receive the result of the operation.

This forces all updates/inserts to the db to go through a single-threaded executor, which prevents concurrent db access and the resulting lock exceptions. This is hopefully a stopgap measure until we get rid of sqlite entirely.

tobias · 2015-04-29T15:38:36Z

That commit serializes db writes through a single thread, and 7fa5980 moves the admin script functionality to the clojars.admin ns so we won't collide with the app's db writes when doing admin tasks. Note that serializing the writes is a stop-gap until we move away from sqlite completely.

thiagofm · 2015-07-27T09:21:55Z

Why can't we use just a normal RDBMS as postgres? I find that more straightforward and extensible.

tobias · 2015-07-30T22:29:46Z

Postgres would be nice, but there are costs associated with moving to it:

it's an additional process to monitor on the server
dev environments would need a postgres running
backups may be a little trickier

The serialized-writes solution that we have currently is a bit gross,
but is working currently, and solves all of the issues we had with
talking directly to sqlite, so I'm hesitant to bring in a more complex
solution right now.

…ojars#307] This forces all updates/inserts to the db to go through a single-threaded executor, which prevents concurrent db access and the resulting lock exceptions. This is hopefully a stopgap measure until we get rid of sqlite entirely.

kirillsalykin · 2015-11-18T16:44:03Z

@tobias what do you think about using datomic free (with h2 as store)?

tobias · 2015-11-21T20:49:48Z

Thanks for the suggestion, but our data isn't something that we need to maintain history of, generally, and I don't want to add the complexity of running a transactor.

jkutner · 2015-12-17T18:23:55Z

PR: #443

thestinger · 2016-02-06T04:37:22Z

You could at least turn on SQLite's WAL mode so readers aren't blocked by writers and vice versa: https://www.sqlite.org/wal.html. The timeout handling is configurable: https://sqlite.org/faq.html#q5. SQLite isn't going to be fast with more than one concurrent writer but there's no reason to be hacking around it: it does work fine, just perhaps not how you want it to by default.

tobias · 2016-02-25T00:05:06Z

@thestinger - we'd happily take a pull request that enables WAL via the java SQLite driver.

danielcompton · 2016-07-30T00:13:33Z

Just a followup on this, I think there were (at least!) two more reasons why we were getting concurrency errors:

1: jdbc-sqlite was compiled without HAVE_USLEEP up until xerial/sqlite-jdbc@4b2b2c3 (xerial/sqlite-jdbc#104).

Running

(require '[clojure.java.jdbc :as j])

(dorun (pmap #(time (let [conn (get-connection)]
                      (dotimes [n 20]
                        (j/with-db-transaction [conn conn]
                                               (j/insert! conn :test {:rand (* % n)}))))) (range 5)))

I get

"Elapsed time: 17.946152 msecs"
"Elapsed time: 1019.802661 msecs"
"Elapsed time: 2022.392828 msecs"
"Elapsed time: 3027.563515 msecs"
SQLException [SQLITE_BUSY]  The database file is locked (database is locked)  org.sqlite.core.DB.newSQLException (DB.java:890)

This looks like the same issue as http://beets.io/blog/sqlite-nightmare.html, where the sqlite driver is sleeping for 1 second at a time before retrying, meaning that concurrent writes could stack up, especially combined with:

2: There is also xerial/sqlite-jdbc#59 which would allow us to use sqlite with two different threading options: multi-threaded, or single threaded. Both of these would probably be faster than the default serialised option.

There hasn't been a release of jdbc-sqlite for a while, but we should look at upgrading and taking advantage of these features when there is.

tobias · 2019-12-31T14:20:21Z

We've moved to postgres (see #736), which has been released as part of Clojars 79.

tobias added the security label Mar 17, 2015

tobias added ready-for-work size:large labels Apr 14, 2015

tobias self-assigned this Apr 27, 2015

tobias mentioned this issue Nov 15, 2015

SQLException: [SQLITE_BUSY] The database file is locked #266

Closed

tobias mentioned this issue Dec 29, 2019

Use postgres instead of sqlite #736

Merged

tobias closed this as completed Dec 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move away from sqlite #307

Move away from sqlite #307

tobias commented Mar 17, 2015

tobias commented Mar 23, 2015

tobias commented Apr 20, 2015

tobias commented Apr 29, 2015

thiagofm commented Jul 27, 2015

tobias commented Jul 30, 2015

kirillsalykin commented Nov 18, 2015

tobias commented Nov 21, 2015

jkutner commented Dec 17, 2015

thestinger commented Feb 6, 2016

tobias commented Feb 25, 2016

danielcompton commented Jul 30, 2016 •

edited

Loading

tobias commented Dec 31, 2019

Move away from sqlite #307

Move away from sqlite #307

Comments

tobias commented Mar 17, 2015

tobias commented Mar 23, 2015

tobias commented Apr 20, 2015

tobias commented Apr 29, 2015

thiagofm commented Jul 27, 2015

tobias commented Jul 30, 2015

kirillsalykin commented Nov 18, 2015

tobias commented Nov 21, 2015

jkutner commented Dec 17, 2015

thestinger commented Feb 6, 2016

tobias commented Feb 25, 2016

danielcompton commented Jul 30, 2016 • edited Loading

tobias commented Dec 31, 2019

danielcompton commented Jul 30, 2016 •

edited

Loading