Skip to content

Commit ea16140

Browse files
committed
Merge branch 'main' of github.com:whitead/paper-qa into main
2 parents ea6b939 + cf51c11 commit ea16140

File tree

1 file changed

+8
-2
lines changed

1 file changed

+8
-2
lines changed

README.md

Lines changed: 8 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,8 +6,7 @@
66
[![MIT license](https://img.shields.io/badge/License-MIT-blue.svg)](https://lbesson.mit-license.org/)
77

88
This is a simple and incomplete package for doing question and answering from
9-
documents. It uses [gpt-index](https://github.com/jerryjliu/gpt_index) to
10-
embed and search documents and [langchain](https://github.com/hwchase17/langchain) to
9+
PDFs or text files (open an issue for more formats). It uses [OpenAI Embeddings](https://platform.openai.com/docs/guides/embeddings) with a vector DB called [FAISS](https://github.com/facebookresearch/faiss) to embed and search documents. [langchain](https://github.com/hwchase17/langchain) helps
1110
generate answers.
1211

1312
It uses this process
@@ -18,6 +17,13 @@ embed docs into vectors -> embed query into vector -> search for top k passages
1817
create summary of each passage relevant to query -> put summaries into prompt -> generate answer
1918
```
2019

20+
## What's New (v0.0.5)
21+
22+
- Replaced gpt-index since we were doing some custom metadata
23+
- Now have page numbers directly in references
24+
- You can now load very large PDFs
25+
- Focusing now only on txt and PDFs to get better reading capabilities
26+
2127
## Example
2228

2329
Question: How can carbon nanotubes be manufactured at a large scale?

0 commit comments

Comments
 (0)