alexandre-touret · alexandre-touret · Mar 25, 2024 · Mar 11, 2024 · Mar 11, 2024 · Mar 12, 2024
diff --git a/content/posts/jpa_spring_data_query_optimisations.en.md b/content/posts/jpa_spring_data_query_optimisations.en.md
@@ -0,0 +1,213 @@
+---
+title: "Tips & tricks for optimizing JPA queries"
+date: 2024-03-24T06:00:43+01:00
+draft: true
+
+featuredImagePreview: "/assets/images/2024/03/tobias-fischer-PkbZahEG2Ng-unsplash.webp"
+featuredImage: "/assets/images/2024/03/tobias-fischer-PkbZahEG2Ng-unsplash.webp"
+images: ["/assets/images/2024/03/tobias-fischer-PkbZahEG2Ng-unsplash.webp"]
+tags:
+  - Java
+  - JPA
+  - Spring_Data
+
+---
+
+{{< style "text-align:center;" >}}
+_Picture of [Tobias Fischer](https://unsplash.com/fr/@tofi?utm_content=creditCopyText&utm_medium=referral&utm_source=unsplash)_
+
+{{< /style >}}
+
+Most of the Java developers I know don't really know what is under the hood of [Spring Data](https://spring.io/projects/spring-data) and [Java Persistence API (JPA)](https://docs.oracle.com/javaee/7/tutorial/persistence-intro.htm).
+In my opinion, it's mainly due to all the features provided by these specifications and frameworks. 
+They bring a lot of simplicity which make you forget SQL queries syntax.
+
+Unfortunately, when your stored data is coming to grow, querying against your database could be difficult. 
+The different queries you can indeed make your Java applications being slow and potentially break your SLOs.
+
+In this article, I tried to write down a bunch of tips & tricks to tackle this issue.
+Even if some are related to [Spring Data](https://spring.io/projects/spring-data), I think you can use most of them if you use a JPA in a standard way.
+
+You will see that even if we can consider using JPA easy at first glance, it can bring a lot of complexity.
+
+## Observe your application
+### Observe your persistence layer
+
+First and foremost, you **MUST** trace and monitor your persistence layer.
+
+If you use Hibernate (without Spring, Quarkus,...), you can get insights configuring the logger:
+
+```xml
+<logger name="org.hibernate.SQL">
+     <level value="debug"/>
+</logger>
+```
+If you use Spring (and JPA, Hibernate), you can also get them adding these configuration properties:
+
+```ini
+spring.jpa.show-sql=true
+spring.jpa.properties.hibernate.format_sql=true
+```
+After getting all the queries and operations done by your persistence layer, you will be able to pinpoint which component is responsible for slowing down your queries: the database or your ORM. 
+
+In the case of huge SQL queries, I usually execute them directly in SQL using the database tools to check if I have the same behaviour.
+
+### Observe your database
+We often forget that database platforms provides valuable tools to analyse your queries. 
+Once you have pointed out the time/resource consuming queries, you must check if your database query is time-consuming because, for instance, does a full scan of your table.
+
+To do that, you can check the SQL queries execution plan.
+
+If you use [PostgreSQL (what else)](https://www.postgresql.org/), you can get these insights using the [``EXPLAIN``](https://www.postgresql.org/docs/current/sql-explain.html) command. 
+
+## Checks your relations
+Let's go back to our Java application.
+One of the main points of interest of any JPA (and SQL) queries is how your entity is joined with others. 
+Every jointure brings costs and complexity.
+
+For JPA queries, you must check first if your relation between two objects should be either [``EAGER`` or ``LAZY``](https://docs.oracle.com/javaee/7/api/javax/persistence/FetchType.html).
+
+You probably understood: there is no free lunch. 
+You must measure first the JPA queries and mapping time-consumption and check which solution is the best.
+
+By default, EAGER relations are set up for ``@ManyToOne`` and ``@OneToOne``. LAZY are for ``@OneToMany``. 
+Most of the time, I keep using the default configuration. 
+
+However, you must take care of the whole [entity graph](https://docs.oracle.com/javaee/7/tutorial/persistence-entitygraphs001.htm) loaded by your query.
+Does your entity loaded by a ``@OneToOne`` relation loads also a ``@OneToMany`` relation in a ``EAGER`` way? 
+
+It's the kind of question you will have to answer.
+
+### The famous N+1 issue
+
+In this example, we will look into a ``1-N`` relation:
+
+```java
+@Entity
+public class Store{
+[...]
+    @OneToMany(fetch = FetchType.EAGER,mappedBy = "store")
+    private List<Book> books;
+[...]
+```
+
+```java
+@Entity
+public class Book {
+[...]
+    @ManyToOne(targetEntity = Store.class)
+    private Store store;
+[...]    
+```
+
+If you remember well, this relation is fetched in a EAGER way.
+So, when I try to get all the stores using a ``findAll()`` method
+
+```java
+public List<Store> findAllStores() {
+    return StreamSupport.stream(storeRepository.findAll().spliterator(), false).toList();
+}
+```
+
+Hibernate will query the database in this way:
+* 1 query to select the main entity
+* N queries for the entities linked by  the jointure
+
+In our case we can see the following queries in the logs:
+
+```shell
+Hibernate: select s1_0.id,s1_0.name from store s1_0
+Hibernate: select b1_0.store_id,b1_0.id,b1_0.description,b1_0.isbn_10,b1_0.isbn_13,b1_0.medium_image_url,b1_0.nb_of_pages,b1_0.price,b1_0.rank,b1_0.small_image_url,b1_0.title,b1_0.year_of_publication from book b1_0 where b1_0.store_id=?
+Hibernate: select a1_0.books_id,a1_1.id,a1_1.firstname,a1_1.lastname,a1_1.public_id from book_authors a1_0 join author a1_1 on a1_1.id=a1_0.authors_id where a1_0.books_id=?
+Hibernate: select a1_0.books_id,a1_1.id,a1_1.firstname,a1_1.lastname,a1_1.public_id from book_authors a1_0 join author a1_1 on a1_1.id=a1_0.authors_id where a1_0.books_id=?
+Hibernate: select a1_0.books_id,a1_1.id,a1_1.firstname,a1_1.lastname,a1_1.public_id from book_authors a1_0 join author a1_1 on a1_1.id=a1_0.authors_id where a1_0.books_id=?
+Hibernate: select a1_0.books_id,a1_1.id,a1_1.firstname,a1_1.lastname,a1_1.public_id from book_authors a1_0 join author a1_1 on a1_1.id=a1_0.authors_id where a1_0.books_id=?
+[...]
+```
+
+It's unfortunately not finished, 
+
+Imagine now, your book entity is related to another one in a EAGER way.
+
+```java
+@ManyToMany(fetch = FetchType.EAGER)
+private List<Author> authors;
+```
+
+You will execute then another SQL queries.
+
+For instance, in this case:
+
+```jshelllanguage
+Hibernate: select b1_0.authors_id,b1_1.id,b1_1.description,b1_1.isbn_10,b1_1.isbn_13,b1_1.medium_image_url,b1_1.nb_of_pages,b1_1.price,b1_1.rank,b1_1.small_image_url,s1_0.id,s1_0.name,b1_1.title,b1_1.year_of_publication from book_authors b1_0 join book b1_1 on b1_1.id=b1_0.books_id left join store s1_0 on s1_0.id=b1_1.store_id where b1_0.authors_id=?
+```
+
+{{< admonition tip "To sum up" true >}}
+At the same way SQL jointures are really time-consuming, the way you can link entities may strongly impact the performance of your queries in either memory or while running SQL queries.   
+{{< /admonition >}}
+
+## Control the number of EAGER/SELECT relations underlying queries using ``@BatchSize``
+We can easily reduce the number of SELECT queries while fetching another entities with the ``@BatchSize`` annotation
+
+```java
+@Entity
+public class Store{
+[...]
+    @OneToMany(fetch = FetchType.EAGER,mappedBy = "store")
+    @BatchSize(size = 5)
+    private List<Book> books;
+[...]
+```
+
+TODO LOGS
+
+### Use a dedicated entity graph
+If you are still struggling with the way Hibernate loads your Entity graph, you can also try to specify the graph of entities to load by yourself.
+It could be really useful if you want to avoid to retrieve specific useless attributes which make your queries really slow.
+
+[JPA 2.1 has introduced this feature](https://docs.oracle.com/javaee/7/tutorial/persistence-entitygraphs001.htm).
+
+Let's go back to our application.
+Imagine that in one use case, when we fetch a list of books, we don't need the list of authors.
+Using this API we can avoid fetching it in this way
+
+TODO CODE
+
+### Create a dedicated entity to reduce the number of attributes
+
+We often forget that we don't need to map all the columns in an entity!
+For instance, if your table has 30 columns and you only need 10 in your use case, why querying, fetching and storing in memory all of these data?
+
+That's why I usually recommend to have, when it's relevant, a dedicated entity for specific use cases. 
+It could be lighter than the _regular_ one and enhance the performances of your application.
+
+For instance
+
+TODO CODE
+{{< admonition warning "Think about data consistency" true >}}
+Think about the whole data consistency or your data stored in the database!
+Be aware about it when you omit specific jointures or columns.
+{{< /admonition >}}
+
+### Use JOIN FETCH in your queries
+
+Now one another strategy is to _manually_ control the jointures and the _fetching_ mode in your queries.
+
+To do that, you can use the ``JOIN FETCH`` instruction.
+
+Use a DTO or a tuple
+
+use @Transactional(readonly=true) 
+
+Pagination w/ Spring Data
+Slice vs Page
+https://stackoverflow.com/questions/49918979/page-vs-slice-when-to-use-which
+https://stackoverflow.com/questions/12644749/way-to-disable-count-query-from-pagerequest-for-getting-total-pages
+
+@Cacheable
+
+Use a sql view
+Last but not least use a good old SQL query
+
+## Conclusion
+https://blog.ippon.tech/boost-the-performance-of-your-spring-data-jpa-application
diff --git a/static/assets/images/2024/03/tobias-fischer-PkbZahEG2Ng-unsplash.webp b/static/assets/images/2024/03/tobias-fischer-PkbZahEG2Ng-unsplash.webp
diff --git a/themes/DoIt b/themes/DoIt
+10 −1		assets/data/social.yml
+1 −0		assets/js/lib/giscus.js
+1 −1		assets/lib/artalk/Artalk.css
+10 −1		assets/lib/artalk/Artalk.js
+1 −1		assets/lib/artalk/ArtalkLite.css
+10 −1		assets/lib/artalk/ArtalkLite.js
+45 −45		assets/lib/echarts/echarts.min.js
+2 −2		assets/lib/echarts/macarons.min.js
+1 −1		assets/lib/twikoo/twikoo.all.min.js
+1 −1		assets/lib/twikoo/twikoo.min.js
+1 −1		assets/lib/waline/waline.css
+71 −1		assets/lib/waline/waline.js
+4 −4		dependencies.json
+12 −3		exampleSite/config/_default/params.toml
+1 −1		exampleSite/content/posts/how-to-DoIt/create-diagrams/index.en.md
+1 −1		exampleSite/content/posts/how-to-DoIt/create-diagrams/index.zh-cn.md
+11 −5		exampleSite/content/posts/theme-documentation-basics/index.en.md
+9 −2		exampleSite/content/posts/theme-documentation-basics/index.zh-cn.md
+2 −2		layouts/_default/_markup/render-image.html
+2 −19		layouts/_default/summary.html
+1 −0		layouts/partials/comment.html
+17 −0		layouts/partials/function/getImage.html
+18 −0		layouts/partials/function/imageHandler.html
+1 −0		layouts/partials/function/isUrlRemote.html
+1 −5		layouts/partials/function/resource.html
+14 −11		layouts/partials/head/seo.html
+1 −1		layouts/partials/plugin/compatibility.html
+36 −29		layouts/partials/plugin/image.html
+1 −17		layouts/posts/single.html
+2 −0		layouts/shortcodes/image.html
+189 −147		package-lock.json
+6 −6		package.json