From 3af50b28478a631d412584e272d3dbb55fe24c2d Mon Sep 17 00:00:00 2001 From: thuypham03 Date: Sun, 14 Apr 2024 22:49:36 -0400 Subject: [PATCH] add sp24 lec7 --- versioned_docs/version-2024sp/lecture7.md | 257 ++++++++++++++++++ .../version-2024sp-sidebars.json | 4 + 2 files changed, 261 insertions(+) create mode 100644 versioned_docs/version-2024sp/lecture7.md diff --git a/versioned_docs/version-2024sp/lecture7.md b/versioned_docs/version-2024sp/lecture7.md new file mode 100644 index 000000000..715188d59 --- /dev/null +++ b/versioned_docs/version-2024sp/lecture7.md @@ -0,0 +1,257 @@ +--- +id: lecture7 +title: Lecture 7 +--- + +[Lecture Slides](https://docs.google.com/presentation/d/1ODV1BWR333kGsO9OUSpprrQt3Fgp0sw85z9Kr4_OuIU/edit?usp=sharing) + +## Intro to Databases and Firebase + +A lot of the apps we have been making work and maintain states throughout the +lifetime of the app, but lack one critical feature - if we restart the app +or refresh the page, all of our data disappears! We need +some kind of persistent storage for this data, which is where **databases** get involved. + +### Why do we need a database for our backend? + +- Data stored within Node.js is per instance +- Most applications require persistence +- Data analysis +- Performant data location +- Offloading unneeded data from memory + +### Types of Databases + +| SQL | NoSQL | +| ----------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------ | +| Stores data in tables, utilizing rows and tables | Stores data in documents | +| Has predefined schema, which can be relational (tables can provide access to related data points in other tables) | Can abide to any structure you want (no schema or partial schema) | +| Less expensive for smaller datasets and can execute complex queries | Better for large sets of data, but not complex queries | +| Less flexible | More flexible (can be used with most popular languages) | +| Leans towards strong consistency | Favors eventual consistency (here may be some delay in getting response back) | +| Vertically scalable (need more computing power on one machine to store more data) | Horizontally scalable (can distribute storage and compute power on multiple machines | +| Examples: MySQL, PostgreSQL | Examples: **Firebase**, MongoDB | + +### What is Firebase? + +- Firebase is a Backend as a Service (BaaS) offered by Google + - Allows you to store data + - Host websites + - Authentication +- NoSQL DB + - Not only SQL + - Non relational + +#### Why Firebase? + +- Real-time operations +- Firebase Authentication +- Built-in analytics +- Also supports hosting/deployment +- Integration with other Google services +- Structure we’re familiar with! + +## Getting Started With Firestore + +If you're having trouble setting up your project, feel free to refer to this video +of me setting one up from [scratch](https://drive.google.com/file/d/18qj5XFPXgZhixGbyBBLMIYkuiyiW-HWs/view?usp=sharing). +If you still have questions, feel free to post on Ed or come to Office Hours! +Let's finally start talking about how we can perform operations on our database +using Typescript. + +### Firestore Data Model + +Firstly, a Firestore database is NOSQL, document-model database generally comprised of multiple collections, +which may house differing data. To take a simplified example, a Cornell database +could have some of the following collections: `people`, `courses`, `locations`. +Certain collections may be broken up into multiple collections, such as breaking +up `people` into `staff` and `students` or even into subcollections, which we will +not discuss but they exist for those who are interested, as we could simply distinguish +different people with a field, such as `role: Student | Staff`. The id of a collection, +which is what is used to access it, is generally a descriptive name of the collection. + +A typical model, including that of Firestore, has collections comprised of documents, +or docs for short, which would be the "items" we want to store. Going back to our +Cornell example, our `people` collection would probably have documents pertaining +to students or staff at Cornell, with each document being uniquely identified +by a unique id, like netid. Within each document, we may have fields like `age` +and `address`, so in this way, documents can be thought of as Objects. But as you +may have already noticed, some documents would have fields that others don't - students +may have a `major` field while staff may have a `salary` field. As it turns out, that's +totally ok! Generally speaking, having more uniform document fields across a collection +gives stronger guarantees about each document and is often a more natural fit, but Firestore +does not require us to have uniform fields within all the documents of a collection, which gives +us a layer of flexibility. + +Using the following code, we can create references to both the `people` collection and +specific docs such as the `myl39` doc (which represents me!). + +```typescript +const peopleCollectionRef = db.collection('people'); +const michelleDocRef = peopleCollectionRef.doc('myl39'); +``` + +### Firestore Operations + +We generally refer to database operations as `CRUD`, which stands for: + +- **C**reate/Insert - Create a document (will fail if the document exists) +- **R**ead/Find/Query - To search for documents based on their properties +- **U**pdate - Update an existing document (will fail otherwise) +- **D**elete - Delete an existing document + +### Create + +To create a document in Firestore, we primarily use the `set` function. +Here's an example of setting a document in our `people` database. + +```typescript +await peopleCollectionRef.set(michelleDocRef, { + year: '2023', + age: 20, +}); +``` + +`set` will write to the document `myl39` within the `people` collection, creating it +if it does not yet exist. + +Note that ids are autogenerated if the name of the document is not specified in the `doc()` call with the `set` function. + +### Read + +To read data in Firestore, we first need to take a snapshot of the data we want to read. Then, +we check if that snapshot of the document we want actually exists before trying to extract its data. + +Here's a simple example logging my data. Notice I log `doc.data()` and not `doc` because +the former is where the data actually resides. + +```typescript +const doc = await peopleCollectionRef.doc(netid).get(); + +if (doc.exists) { + console.log('Document data:', doc.data()); +} else { + console.log('No such document!'); +} +``` + +We can also perform querying on collections, where we filter the database with certain +criteria. Here's a simple example of looking for all people in `people` who are 20. + +```typescript +const snapshot = await peopleCollectionRef +.where(“age”, “==”, 20) +.get(); + +snapshot .forEach((doc) => { + console.log(doc.id, ' => ', doc.data()); +}); +``` + +Line by line, we first take a snapshot of all such documents in the query specified, and then, we perform the same `doc.data()` approach. We do not need to check if each doc exists because we are using a for loop, and if a doc does not exist, it simply would not be part of the for loop. + +We can also order our search results. Here is an example of querying for people who are at least 20 and then ordering them in descending order. + +```typescript +const snapshot = await peopleCollectionRef +.where(“age”, “>=”, 20) +.orderBy(“age”, “desc”) +.limit(3).get(); + +querySnapshot.forEach((doc) => { + console.log(doc.id, ' => ', doc.data()); +}); +``` + +### Update + +Updating a document will only replace the specified fields within a doc and maintain +unmodified fields. So the following code keep the `last` field but change `age` and +`first`. + +```typescript +await peopleCollectionRef +.doc(‘myl39’) +.update({ age: '20', first: 'michelle' }); +``` + +Note if the document we're trying to access is not available, an error will occur. + +### Delete + +Deleting a document removes it from the collection. + +```typescript +await peopleCollectionRef +.doc(‘myl39’) +.delete(); +``` + +## Callback/Promise-based vs Real-Time Queries + +| Promise-Based | Real-Time | +| ------------------------------------------------------------------------------------------------------------------------------------------------------ | -------------------------------------------------------------------------------------------------------------------------------------------------------------- | +| If you need the data now, you can query for it | You already have the data | +| Data queries can be decentralized (done in any component) | Data queries are fetched and memoized through centralized (React) hooks | +| Querying data is imperative, but can quickly become hard to maintain and track (and you lose some of the advantages of a declarative web UI framework) | Up-front cost to query data pays off (because you don't _hopefully_ have to query it again) | +| There is no cleanup code | You first have to "subscribe" to changes in the data, then unsubscribe after you are done (kind of like opening and closing a file stream when reading a file) | + +### What do Callback/Promise-based vs. Real-Time Queries Look Like? + +**Promise-based queries** are single queries that return a single async result. So, they are run once and then passed along downstream to children and other descendants of your component. Typically, they are used to react to some update (i.e user clicks a button, a component loads). + +**Real-time queries** are single queries that return a stream of async results such as weather data. These types of queries are used once the data is listenable and needs to be "subscribed to". These take a stream of results and are built on top of wbe sockets, which are abstractions over a byte stream. So, they're good for ... real-time applications. + +### How Do Callback/Promised Based vs. Real-Time Queries Work? + +**Promise based queries** typically calls some backend API route, which fetches and returns data to you. They're built on top of traditional HTTP requests. + +**Real-time queries** might call a backend route to pass data over to a web socket or it'll simply use an API library to makes calls directly to a database (ex. Firebase Firestore call). These queries are usually wrapped in a library like [RxJS](https://rxjs.dev/)'s observable data type or function calls that allow you to subsribe to changes. + +### Choosing a Querying Method + +As described in the first section, the type of queries your application will use will affect the app's architecture. +In particular, real-time queries play nicely with having a centralized query that runs over a listenable data access object that is "owned" either by + +1. a top-level component (OK in small apps, but prone to prop drilling in more complex apps), or +2. a custom React hook that wraps an effect (triggering an update when the data access object publishes a new version of the data) + +That is not to say that your app cannot use both types of queries. It is just that a real-time application requires a specific architecture in which all data is queried first and passed along to components as props or referenced by components via (potentially custom) React/Redux hooks. This does not play nicely with callback/Promise-based queries because the data from the callback/Promise-based queries may be in an inconsistent state by the time the data from a real-time query has updated. + +### Firebase Firestore Application: Callback/Promise-based or Real-Time Queries + +Firestore offers you a database that nicely organizes your data into _documents_ and _collections_ (groups of documents). It allows you to build queries that can either + +1. return _once_ with a single snapshot of data (a Promise-based query), or +2. allow you to hook into the data's live values (a real-time query). + +#### Firestore Real-time Queries + +Provides collection + document data as an listenable (subscribable) data object + +- As soon as a collection updates, the collection access object publishes a new version of the collection +- As soon as a doc updates, the doc access object publishes a new version of the doc + This can be passed as a React prop or an effect dependency, which triggers a component update! + +#### Anatomy of a Firebase Firestore Real-Time Application (The "Full" Stack) + +![Anatomy of a Firebase Firestore Real-Time Update](/img/lec8/firestore-real-time.png) + +Unlike callback/promise-based queries, the connection between updating and fetching data is completely gone. Updating data occurs along an entirely separate channel from subscribing to the data. This means that implementing calls to update data will look very different + +#### Miscellaneous Advice + +When designing a system: + +- avoid two-way dependencies (or as many dependencies as possible) + - as with React & declarative web frameworks, one-way data binding is the way to go + - avoids: more things to update + - avoids: more surface area for synchronization errors + +This philosophy helps us prefer real-time queries over Promise-based queries, because there is only a single dependency for the queried data, rather than the set of all the decentralized Promise-based queries. + +## Sample code + +This week's sample code starter can be found in the files under [this directory](https://github.com/cornell-dti/trends-fp23-lec7-demo). + +[demo solution](https://github.com/michelleli01/trends-lec7-demo-soln) diff --git a/versioned_sidebars/version-2024sp-sidebars.json b/versioned_sidebars/version-2024sp-sidebars.json index 2a2ce1694..2ef01be5b 100644 --- a/versioned_sidebars/version-2024sp-sidebars.json +++ b/versioned_sidebars/version-2024sp-sidebars.json @@ -30,6 +30,10 @@ { "type": "doc", "id": "version-2024sp/lecture6" + }, + { + "type": "doc", + "id": "version-2024sp/lecture7" } ], "Assignments": [