This document describes a proposed algorithm for extracting information from PDF documents using natural language processing (NLP). The algorithm aims to automate the extraction of key data like company metrics and financial details that analysts currently extract manually. It involves identifying keywords, extracting text and tables using rule-based filters, and presenting the extracted information in a structured format like a table. The algorithm is intended to simplify the information extraction process and make it scalable for large documents. It provides a framework that can be modified based on user needs and categories of interest.