Llamaindex ChatBot

what is ChatBot Chat bot은 사용자가 질문을 하면 원하는 답변을 해주는 것을 이야기합니다. 아래는 Chat bot을 활용한 일부 예시 입니다. simple example system prompt example templet example llama-parser, faiss example

AI

/

NLP

/

llama index · 2024-03-19

Llamaindex RAG

what is RAG RAG는 Retrieval Augmented Generation의 약자로 언어 모델의 응답이 조금 더 좋은 결과를 도출하기 위한 것입니다. 이는 추가적인 데이터들을 기반으로 좋은 응답 결과를 보장하게 됩니다. 아래는 RAG를 활용한 일부 예시 입니다. simple example SentenceWindowNodeParser example llama-parser example llama-parser, faiss example

AI

/

NLP

/

llama index · 2024-03-18

Llamaindex retriever

what is retriever retriever는 검색엔진과 같은 역활을 합니다. index에 있는 값들을 query를 이용하여 관련된 내용을 추출해 내줍니다. how to use retriever 간단하게 사용하는 방식은 아래와 같이 사용할 수 있습니다. {% highlight shell %} retriever = index.as_retriever() nodes = retriever.retrieve(“{question}”) {% endhighlight %} how to use retriever advance retriever를 사용하는 고급 기법이 아래와 같이 존재합니다. 이방식은 index의 종류별로 상세하게 세팅을 하는 방법이며 retriever modes를 참고하여 다양한 retriever를 만들어 볼 수 있습니다. {% highlight shell %} retriever = summary_index.as_retriever( retriever_mode=”llm”, choice_batch_size=5, ) {% endhighlight %}

AI

/

NLP

/

llama index · 2024-03-15

Llamaindex pipeline

AI

/

NLP

/

llama index · 2024-03-14

Llamaindex embedding

what is embedding embedding은 입력을 받은 document or node에 있어서 vector로 나타내는것입니다. 이를 통하여 코사인 유사도와 같이 문서들간의 유사성을 계산하여 문서를 효율적으로 사용할 수 있게 됩니다. llama는 기본적으로 코사인 유사도를 사용하고 있으며 아래의 방식으로 다양한 embedding을 사용해 볼 수 있습니다. W. OpenAI OpenAI에서 사용하는 embedding을 사용하려면 아래와 같이 사용하면 됩니다. 하지만 유료인점을 참고해야합니다. {% highlight shell %} pip install llama-index-embeddings-openai {% endhighlight %} {% highlight python %} import os OPENAI_API_TOKEN = “sk-“ os.environ[“OPENAI_API_KEY”] = OPENAI_API_TOKEN from llama_index.embeddings.openai import OpenAIEmbedding from llama_index.core import Settings global Settings.embed_model = OpenAIEmbedding(embed_batch_size=42) # default is 10 per-index index = VectorStoreIndex.from_documents(documents, embed_model=embed_model) {% endhighlight %} W. hugging face hugging face를 사용하여 enbedding을 하는 방식은 아래와 같습니다. {% highlight shell %} pip install llama-index-embeddings-huggingface {% endhighlight %} {% highlight python %} from llama_index.embeddings.huggingface import HuggingFaceEmbedding from llama_index.core import Settings Settings.embed_model = HuggingFaceEmbedding( model_name=”BAAI/bge-small-en-v1.5” ) {% endhighlight %} W. hugging face(W. ONNX) hugging face를 ONNX로 사용하는 법은 아래와 같습니다. {% highlight shell %} pip install transformers optimum[exporters] pip install llama-index-embeddings-huggingface-optimum {% endhighlight %} {% highlight python %} from llama_index.embeddings.huggingface_optimum import OptimumEmbedding OptimumEmbedding.create_and_save_optimum_model( “BAAI/bge-small-en-v1.5”, “./bge_onnx” ) Settings.embed_model = OptimumEmbedding(folder_name=”./bge_onnx”) {% endhighlight %} W. langchain langchain에서 지원하는 다양한 embedding을 사용할 수 있습니다. langchain embeddings list {% highlight shell %} pip install llama-index-embeddings-langchain {% endhighlight %} {% highlight python %} from langchain.embeddings.huggingface import HuggingFaceBgeEmbeddings from llama_index.core import Settings Settings.embed_model = HuggingFaceBgeEmbeddings(model_name=”BAAI/bge-base-en”) {% endhighlight %} W. custom embedding 위에서 사용할 수 있는 다양한 embedding 이외에 다른 embedding을 직접 만들어서 활용하려면 아래와 같이 해볼 수 있습니다. {% highlight python %} from typing import Any, List from InstructorEmbedding import INSTRUCTOR from llama_index.core.embeddings import BaseEmbedding class InstructorEmbeddings(BaseEmbedding): def init( self, instructor_model_name: str = “hkunlp/instructor-large”, instruction: str = “Represent the Computer Science documentation or question:”, kwargs: Any, ) -> None: self._model = INSTRUCTOR(instructor_model_name) self._instruction = instruction super().__init__(kwargs) def _get_query_embedding(self, query: str) -> List[float]: embeddings = self._model.encode([[self._instruction, query]]) return embeddings[0] def _get_text_embedding(self, text: str) -> List[float]: embeddings = self._model.encode([[self._instruction, text]]) return embeddings[0] def _get_text_embeddings(self, texts: List[str]) -> List[List[float]]: embeddings = self._model.encode( [[self._instruction, text] for text in texts] ) return embeddings async def _get_query_embedding(self, query: str) -> List[float]: return self._get_query_embedding(query) async def _get_text_embedding(self, text: str) -> List[float]: return self._get_text_embedding(text) {% endhighlight %} other embeddings 이외에도 다양한 embedding을 사용할 수 있으며 아래는 지원하는 embedding list 입니다. embeddings list

AI

/

NLP

/

llama index · 2024-03-13

Llamaindex index

what is index index는 RAG와 같이 검색을 하는 구조에서 빠르게 검색하기 위한 구조입니다. 추가적인 활용처로는 채팅봇과 같이 QA로 사용할 수 있습니다. vector store index index 기법에서 가장 흔하게 사용이 되는 방법입니다. 이는 vector store를 활용하여 indexing을 하는 방법입니다. 아래와 같이 document을 바로 활용하는 방법과 node를 활용하는 방법 2가지로 이루어져 있습니다. {% highlight python %} from llama_index.core import VectorStoreIndex index = VectorStoreIndex.from_documents(documents) {% endhighlight %} {% highlight python %} from llama_index.core.schema import TextNode node1 = TextNode(text=”", id_="") node2 = TextNode(text="", id_="") nodes = [node1, node2] index = VectorStoreIndex(nodes) {% endhighlight %} default vectorstore이외에도 다양한 custom vectorstore를 사용할 수 있으며 아래는 간단한 예시를 나타냅니다. {% highlight python %} import pinecone from llama_index.core import ( VectorStoreIndex, SimpleDirectoryReader, StorageContext, ) from llama_index.vector_stores.pinecone import PineconeVectorStore init pinecone pinecone.init(api_key=”", environment="") pinecone.create_index( "quickstart", dimension=1536, metric="euclidean", pod_type="p1" ) construct vector store and customize storage context storage_context = StorageContext.from_defaults( vector_store=PineconeVectorStore(pinecone.Index(“quickstart”)) ) Load documents and build index documents = SimpleDirectoryReader( “../../examples/data/paul_graham” ).load_data() index = VectorStoreIndex.from_documents( documents, storage_context=storage_context ) {% endhighlight %} other index guides vector store가 가장 흔한 indexing 기법이지만 그 이외에도 아래와 같이 다양한 기법들이 있습니다. other index guides W. other embedding module 기본적으로 llama에서 제공하는 embedding으로 동작이 되지만 다른 embedding을 사용하고 싶으면 아래를 참고하여 변경이 가능합니다. embedding module pipeline documents advance(1)와 nodes advance(1)까지 확인 이후 pipeline을 아래와 같이 도입 가능합니다. document node index pipeline

AI

/

NLP

/

llama index · 2024-03-12

Llamaindex nodes Advance(1)

AI

/

NLP

/

llama index · 2024-03-11

Llamaindex nodes

what is nodes 노드는 documents를 텍스트, 이미지 등등의 각 chunk로 나누는 것을 의미합니다. 이렇게 생성된 노드는 metadata정보와 관계도 정보가 포함되어 있습니다. how to use nodes(W. documents) 아래의 방식으로 node를 활용하기 위하여 documents를 사용할 수 있어야합니다. 아래의 링크를 참고해주세요. documents documents를 활용하여 간단하게 node를 사용하려면 다음과 같이 사용하면 됩니다. {% highlight python %} from llama_index.core.node_parser import SentenceSplitter parser = SentenceSplitter() nodes = parser.get_nodes_from_documents(documents) {% endhighlight %} how to use nodes(custom text) 아래의 방식으로 각각의 text를 수동으로 node를 만들어 줄 수도 있습니다.(고급) {% highlight python %} from llama_index.core.schema import TextNode, NodeRelationship, RelatedNodeInfo node1 = TextNode(text=”", id_="") node2 = TextNode(text="", id_="") set relationships node1.relationships[NodeRelationship.NEXT] = RelatedNodeInfo( node_id=node2.node_id ) node2.relationships[NodeRelationship.PREVIOUS] = RelatedNodeInfo( node_id=node1.node_id ) nodes = [node1, node2] {% endhighlight %} 또한 아래와 같이 node간의 종속적 정보를 추가 할 수 있습니다. {% highlight python %} node2.relationships[NodeRelationship.PARENT] = RelatedNodeInfo( node_id=node1.node_id, metadata={“key”: “val”} ) {% endhighlight %} 노드는 다음의 방식으로 id를 직접 주입할 수 있습니다. 이러한 id 값은 다양한 역활을 할 수 있습니다. {% highlight python %} node.node_id = “My new node_id!” {% endhighlight %} Advance nodes advance(1)

AI

/

NLP

/

llama index · 2024-03-08

Llamaindex documents Advance(1)

documents loaders flat document documents는 다양한 형태를 가진 파일들을 불러오는데 사용이 될 수 있으나, 단순한 파일을 불러올 수도 있습니다. 단순한 파일을 불러올때는 아래와 같이 단순한 방식이 제공됩니다. {% highlight python %} from llama_index.readers.file import FlatReader from pathlib import Path md_docs = FlatReader().load_data(Path(“./test.md”)) {% endhighlight %} other document loader other document loader metadata extraction usage pattern 다음과 같이 LLM을 사용하여 metadata를 추출해낼 수 있습니다. {% highlight shell %} pip install llama-index-extractors-entity {% endhighlight %} {% highlight python %} import os OPENAI_API_TOKEN = “sk-“ os.environ[“OPENAI_API_KEY”] = OPENAI_API_TOKEN llm = OpenAI(temperature=0.1, model=”gpt-3.5-turbo”, max_tokens=512) from llama_index.core.extractors import ( TitleExtractor, QuestionsAnsweredExtractor, SummaryExtractor, KeywordExtractor, BaseExtractor, ) from llama_index.extractors.entity import EntityExtractor class CustomExtractor(BaseExtractor): def extract(self, nodes): metadata_list = [ { “custom”: ( node.metadata[“document_title”] + “\n” + node.metadata[“excerpt_keywords”] ) } for node in nodes ] return metadata_list title_extractor = TitleExtractor(nodes=5) qa_extractor = QuestionsAnsweredExtractor(questions=3) summary_extractor = SummaryExtractor(summaries=[“prev”, “self”,”next”]) keyword_extractor = KeywordExtractor(keywords=10, llm=llm), custom_extractor = CustomExtractor() entity_extractor = EntityExtractor( prediction_threshold=0.5, label_entities=False, # include the entity label in the metadata (can be erroneous) device=”cpu”, # set to “cuda” if you have a GPU ) {% endhighlight %} pipeline nodes advance(1)까지 확인 이후 pipeline을 아래와 같이 도입 가능합니다. document node pipeline

AI

/

NLP

/

llama index · 2024-03-07

Llamaindex documents

AI

/

NLP

/

llama index · 2024-03-06

Llamaindex intro

AI

/

NLP

/

llama index · 2024-03-05

Hugging face intro

How to start 우선 허깅페이스에 가입을 해야합니다. Hugging face 가입을 하고나면 아래와 같은 설명이 나옵니다. Authentication 홈페이지 가입이후 이메일의 인증을 해줘야하며, 인증을 완료하면 아래과 같이 organization을 설정할 수 있다. 이미 존재하는 organization에 가입하거나 직접 만들어주면 된다. 이메일 인증 이후 setting에서 Authentication에 접근하면 아래와 같이 세팅을 할 수 있다. 2FA 세팅에는 google에서 제공하는 Authentication 어플을 활용하여 진행이 가능하다. create personal repository 홈페이지에서 관리할 수 있지만 CLI를 통하여 아래와 같이 관리가 가능하다. 홈페이지 setting에서 Access Tokens에 접근하면 token을 생성할 수 있습니다. token은 읽기용 쓰기용 2가지로 나뉘어 진다. 서버에서 가져와서 활용할때는 read, 서버에 등록할때는 write를 활용하면 됩니다. {% highlight shell %} pip install huggingface_hub huggingface-cli login huggingface-cli repo create --type {model, dataset, space} {% endhighlight %} use personal repository 개인 레포지토리를 사용하려면 아래와 같이 가져와서 git과 같이 활용하면 됩니다. {% highlight shell %} git lfs install git clone https://huggingface.co// {% endhighlight %} use hugging face model 코드상으로 huggingface를 활용하려면 아래와 같은 폼을 활용하면 활용이 가능하다. 자세한 방법은 각각의 모델과 토크나이저를 업로드한 organization을 확인하면 됩니다. {% highlight shell %} from transformers import AutoModelForCausalLM, AutoTokenizer REPO_ID = “” FILENAME = “” model_id = f”{REPO_ID}/{FILENAME}” model = AutoModelForCausalLM.from_pretrained(model_id) tokenizer = AutoTokenizer.from_pretrained(model_id) {% endhighlight %}

AI

/

NLP

/

hugging face · 2024-03-04

Ollama intro

How to start 우선 ollama를 설치하여 진행해야하기 때문에 아래에서 OS에 맞는 ollama를 우선 설치해야 합니다. ollama를 설치하였다면 사용할 모델을 아래와 같이 받으면 됩니다. {% highlight shell %} ollama pull {% endhighlight %} 다운받을 수있는 모델은 다음 홈페이지에서 확인이 가능합니다. ollama check installed model 설치한 모델을 확인하려면 다음과 같이 확인이 가능합니다. {% highlight shell %} ollama list {% endhighlight %} check installed model info 설치한 모델의 정보를 확인하려면 다음과 같이 확인이 가능합니다. {% highlight shell %} ollama show {--license, --modelfile, --parameters, --system, --template} {% endhighlight %} copy installed model 설치한 모델을 복제하려면 다음과 같이 진행이 가능합니다. {% highlight shell %} ollama cp {% endhighlight %} run model in CLI 설치한 모델을 CLI에서 실행하려면 다음과 같이 진행이 가능합니다. {% highlight shell %} ollama run {% endhighlight %} remove installed model 설치한 모델을 삭제하려면 다음과 같이 진행이 가능합니다. {% highlight shell %} ollama rm {% endhighlight %}

AI

/

NLP

/

ollama · 2024-03-01

2. Sequence embedding

Seq2Seq 중심 단어와 주변 단어를 통한 예측 기반의 학습법 ELMO 주변 단어를 보고 중심 단어를 예측하는 방법 Transformer 중심 단어를 보고 주변 단어를 예측하는 방법(학습 횟수가 많음) GPT <>으로 단어를 구분하고 n-gram을 통하여 단어를 나눠서 학습한다 skip-gram과 유사한 학습법 sub word들을 학습해 유사한 단어학습이 가능 BERT 기존의 LSA(Latent Semantic Analysis)는 문서에서 단어의 빈도를 기준으로 차원축소를 하는 방법론 -> 단어 의미 유추에 약함

AI

/

NLP

/

basic · 2023-12-12

1. Word embedding

Word2Vec 중심 단어와 주변 단어를 통한 예측 기반의 학습법 유사어 구별이 힘듬 단어의 빈도수에 영향을 많이 받음 새로운 단어학습시 전체학습이 필요 사전의 크기가 학습시간에 영향이 큼 CBOW 주변 단어를 보고 중심 단어를 예측하는 방법 Skip-gram 중심 단어를 보고 주변 단어를 예측하는 방법(학습 횟수가 많음) FastText <>으로 단어를 구분하고 n-gram을 통하여 단어를 나눠서 학습한다 skip-gram과 유사한 학습법 sub word들을 학습해 유사한 단어학습이 가능 GloVe(Global Vectors for Word Representation) 기존의 LSA(Latent Semantic Analysis)는 문서에서 단어의 빈도를 기준으로 차원축소를 하는 방법론 -> 단어 의미 유추에 약함 새로운 방법을 제안함(단어의 유사도를 고려) 윈도우 기반 동시 등장 행렬 앞뒤로 등장한 단어들을 테이블화 하여 행렬로 만듬 동시 등장확률 해당 행의 전체값에서 해당하는 값을 나눈값 손실함수 동시 등장확률과 유사하게 나올 수 있게 함 konlpy gensim

AI

/

NLP

/

basic · 2023-12-11

Dspy intro

How to start {% highlight shell %} from transformers import AutoModelForCausalLM, AutoTokenizer {% endhighlight %}

AI

/

NLP

/

dspy · 2023-04-02

WTMO-dev

Contact

NLP