Use tensorflow.js for the query and llm.
tensorflow.js
Use precomputed vector space for searching.
Use local storage for customization.