You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is a simple and incomplete package for doing question and answering from
9
-
documents. It uses [gpt-index](https://github.com/jerryjliu/gpt_index) to
10
-
embed and search documents and [langchain](https://github.com/hwchase17/langchain) to
9
+
PDFs or text files (open an issue for more formats). It uses [OpenAI Embeddings](https://platform.openai.com/docs/guides/embeddings) with a vector DB called [FAISS](https://github.com/facebookresearch/faiss) to embed and search documents. [langchain](https://github.com/hwchase17/langchain) helps
11
10
generate answers.
12
11
13
12
It uses this process
@@ -18,6 +17,13 @@ embed docs into vectors -> embed query into vector -> search for top k passages
18
17
create summary of each passage relevant to query -> put summaries into prompt -> generate answer
19
18
```
20
19
20
+
## What's New (v0.0.5)
21
+
22
+
- Replaced gpt-index since we were doing some custom metadata
23
+
- Now have page numbers directly in references
24
+
- You can now load very large PDFs
25
+
- Focusing now only on txt and PDFs to get better reading capabilities
26
+
21
27
## Example
22
28
23
29
Question: How can carbon nanotubes be manufactured at a large scale?
0 commit comments