Loading pre-trained spanBERT from ./pretrained_spanbert ____ Parameters: Client key = XXXXXX Engine key = XXXXXX Gemini key = XXXXXX Method = spanbert Relation = Top_Member_Employees Threshold = 0.7 Query = microsoft bill gates # of Tuples = 10 Loading necessary libraries; This should take a minute or so ...) =========== Iteration: 0 - Query: microsoft bill gates =========== URL ( 1 / 10): https://en.wikipedia.org/wiki/Bill_Gates Fetching text from url ... Trimming webpage content from 146298 to 10000 characters Webpage length (num characters): 10000 Annotating the webpage using spacy... Extracted 49 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Processed 5 / 49 sentences Processed 10 / 49 sentences === Extracted Relation === Input tokens: ['founder', 'of', 'TerraPower', 'Founder', 'of', 'Breakthrough', 'Energy', 'Founder', 'of', 'Gates', 'Ventures', 'Technology', 'advisor', 'of', 'Microsoft', 'Spouse', 'Melinda', 'French', ' ', '('] Output Confidence: 0.78589326 ; Subject: Gates Ventures Technology ; Object: Melinda ; Adding to set of extracted relations ========== Processed 15 / 49 sentences Processed 20 / 49 sentences === Extracted Relation === Input tokens: ['Following', 'Microsoft', "'s", '1986', 'initial', 'public', 'offering', '(', 'IPO', ')', ',', 'Gates', 'became', 'a', 'billionaire', 'in', '1987', ','] Output Confidence: 0.76217777 ; Subject: Microsoft ; Object: Gates ; Adding to set of extracted relations ========== Processed 25 / 49 sentences === Extracted Relation === Input tokens: ['He', 'stepped', 'down', 'as', 'chairman', 'of', 'the', 'board', 'of', 'directors', 'in', '2014', 'and', 'became', 'technology', 'adviser', 'to', 'newly', 'appointed', 'CEO', 'Satya', 'Nadella', 'and', 'other', 'Microsoft', 'leaders', ','] Output Confidence: 0.9918389 ; Subject: Microsoft ; Object: Satya Nadella ; Adding to set of extracted relations ========== Processed 30 / 49 sentences === Extracted Relation === Input tokens: ['Gates', 'and', 'French', 'Gates', 'co', '-', 'chaired', 'the', 'foundation', 'until', '2024', ',', 'when', 'the', 'latter', 'resigned', 'following', 'the', 'couple', "'s", 'divorce', ';', 'it', 'has', 'since', 'been', 'renamed', 'the', 'Gates', 'Foundation', ',', 'with', 'Gates', 'serving', 'as', 'its', 'sole', 'chair', '.'] Output Confidence: 0.9689595 ; Subject: French Gates ; Object: Gates ; Adding to set of extracted relations ========== === Extracted Relation === Input tokens: ['it', 'has', 'since', 'been', 'renamed', 'the', 'Gates', 'Foundation', ',', 'with', 'Gates', 'serving', 'as', 'its', 'sole', 'chair', '.'] Output Confidence: 0.9899277 ; Subject: the Gates Foundation ; Object: Gates ; Adding to set of extracted relations ========== Processed 35 / 49 sentences Processed 40 / 49 sentences Processed 45 / 49 sentences Extracted annotations for 4 out of total 49 sentences Relations extracted from this website: 5 (Overall: 5) URL ( 2 / 10): https://www.instagram.com/thisisbillgates/?hl=en Fetching text from url ... Webpage length (num characters): 0 Annotating the webpage using spacy... Extracted 0 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Extracted annotations for 0 out of total 0 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 3 / 10): https://x.com/billgates Fetching text from url ... Webpage length (num characters): 251 Annotating the webpage using spacy... Extracted 5 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Processed 5 / 5 sentences Extracted annotations for 0 out of total 5 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 4 / 10): https://www.gatesnotes.com/ Fetching text from url ... Webpage length (num characters): 212 Annotating the webpage using spacy... Extracted 2 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Extracted annotations for 0 out of total 2 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 5 / 10): https://www.youtube.com/billgates Fetching text from url ... Webpage length (num characters): 166 Annotating the webpage using spacy... Extracted 1 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Extracted annotations for 0 out of total 1 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 6 / 10): https://news.microsoft.com/2020/03/13/microsoft-announces-change-to-its-board-of-directors/ Fetching text from url ... Webpage length (num characters): 35 Annotating the webpage using spacy... Extracted 1 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Extracted annotations for 0 out of total 1 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 7 / 10): https://www.microsoft.com/ Fetching text from url ... Webpage length (num characters): 2103 Annotating the webpage using spacy... Extracted 5 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Processed 5 / 5 sentences Extracted annotations for 0 out of total 5 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 8 / 10): https://news.microsoft.com/tag/bill-gates/ Fetching text from url ... Webpage length (num characters): 35 Annotating the webpage using spacy... Extracted 1 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Extracted annotations for 0 out of total 1 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 9 / 10): https://answers.microsoft.com/en-us/windows/forum/all/a-short-letter-to-bill-gates/59fdd4d0-6330-40da-9e66-fa6c29836454 Fetching text from url ... Webpage length (num characters): 13 Annotating the webpage using spacy... Extracted 1 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Extracted annotations for 0 out of total 1 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 10 / 10): https://medium.com/@HeathEvans/content-is-king-essay-by-bill-gates-1996-df74552f80d9 Fetching text from url ... Webpage length (num characters): 7246 Annotating the webpage using spacy... Extracted 60 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Processed 5 / 60 sentences Processed 10 / 60 sentences Processed 15 / 60 sentences Processed 20 / 60 sentences Processed 25 / 60 sentences Processed 30 / 60 sentences Processed 35 / 60 sentences Processed 40 / 60 sentences Processed 45 / 60 sentences Processed 50 / 60 sentences Processed 55 / 60 sentences === Extracted Relation === Input tokens: ['MarketingSocial', 'MediaMarketingLeadershipBusiness----13FollowWritten', 'by', 'Heath', 'Evans1.4', 'K', 'Followers·3.3', 'K', 'FollowingMarketing', '&', 'Communications', 'Manager', 'at', 'Melbourne', 'Accelerator', 'Program', '('] Output Confidence: 0.98368984 ; Subject: Melbourne Accelerator Program ; Object: Heath Evans1.4K Followers·3.3K FollowingMarketing & Communications ; Adding to set of extracted relations ========== Processed 60 / 60 sentences Extracted annotations for 1 out of total 60 sentences Relations extracted from this website: 1 (Overall: 1) ================== ALL RELATIONS for org:top_members/employees ( 6 ) ================= Confidence: 0.9918389 | Subject: Microsoft | Object: Satya Nadella Confidence: 0.9899277 | Subject: the Gates Foundation | Object: Gates Confidence: 0.98368984 | Subject: Melbourne Accelerator Program | Object: Heath Evans1.4K Followers·3.3K FollowingMarketing & Communications Confidence: 0.9689595 | Subject: French Gates | Object: Gates Confidence: 0.78589326 | Subject: Gates Ventures Technology | Object: Melinda Confidence: 0.76217777 | Subject: Microsoft | Object: Gates =========== Iteration: 1 - Query: Microsoft Satya Nadella =========== URL ( 1 / 10): https://news.microsoft.com/source/exec/satya-nadella/ Fetching text from url ... Webpage length (num characters): 35 Annotating the webpage using spacy... Extracted 1 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Extracted annotations for 0 out of total 1 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 2 / 10): https://en.wikipedia.org/wiki/Satya_Nadella Fetching text from url ... Trimming webpage content from 31577 to 10000 characters Webpage length (num characters): 10000 Annotating the webpage using spacy... Extracted 69 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Processed 5 / 69 sentences Processed 10 / 69 sentences === Extracted Relation === Input tokens: ['active1992', '–', 'presentTitleChairman', 'and', 'CEO', 'of', 'MicrosoftSpouse', 'Anupama', 'Nadella', '('] Output Confidence: 0.96966493 ; Subject: MicrosoftSpouse ; Object: active1992 ; Adding to set of extracted relations ========== === Extracted Relation === Input tokens: ['active1992', '–', 'presentTitleChairman', 'and', 'CEO', 'of', 'MicrosoftSpouse', 'Anupama', 'Nadella', '(', 'm.', '1992)Children3AwardsPadma', 'Bhushan', '(', '2022)WebsiteMicrosoft', 'profileSignature', 'Satya', 'Narayana', 'Nadella', '('] Output Confidence: 0.97394663 ; Subject: MicrosoftSpouse ; Object: Satya Narayana Nadella ; Adding to set of extracted relations ========== === Extracted Relation === Input tokens: ['of', 'Microsoft', ',', 'succeeding', 'Steve', 'Ballmer', 'in', '2014', 'as', 'CEO[2][3', ']'] Output Confidence: 0.9244462 ; Subject: Microsoft ; Object: Steve Ballmer ; Adding to set of extracted relations ========== === Extracted Relation === Input tokens: ['of', 'Microsoft', ',', 'succeeding', 'Steve', 'Ballmer', 'in', '2014', 'as', 'CEO[2][3', ']', 'and', 'John', 'W.', 'Thompson', 'in', '2021', 'as', 'chairman.[4][5', ']'] Output Confidence: 0.9902232 ; Subject: Microsoft ; Object: John W. Thompson ; Adding to set of extracted relations ========== === Extracted Relation === Input tokens: ['succeeding', 'Steve', 'Ballmer', 'in', '2014', 'as', 'CEO[2][3', ']', 'and', 'John', 'W.', 'Thompson', 'in', '2021', 'as', 'chairman.[4][5', ']', 'Before', 'becoming', 'CEO', ',', 'he', 'was', 'the', 'executive', 'vice', 'president', 'of', 'Microsoft', "'s", 'cloud', 'and', 'enterprise', 'group', ','] Output Confidence: 0.9909373 ; Subject: Microsoft ; Object: Steve Ballmer ; Adding to set of extracted relations ========== === Extracted Relation === Input tokens: ['and', 'John', 'W.', 'Thompson', 'in', '2021', 'as', 'chairman.[4][5', ']', 'Before', 'becoming', 'CEO', ',', 'he', 'was', 'the', 'executive', 'vice', 'president', 'of', 'Microsoft', "'s", 'cloud', 'and', 'enterprise', 'group', ','] Output Confidence: 0.9892769 ; Subject: Microsoft ; Object: John W. Thompson ; Duplicate with lower confidence than existing record. Ignoring this. ========== === Extracted Relation === Input tokens: ['Bukkapuram', 'Nadella', 'Yugandhar', ',', 'was', 'an', 'Indian', 'Administrative', 'Service', 'officer', 'of', 'the', '1962', 'batch.[12][13][9', ']'] Output Confidence: 0.98870194 ; Subject: Indian Administrative Service ; Object: Bukkapuram Nadella Yugandhar ; Adding to set of extracted relations ========== Processed 15 / 69 sentences === Extracted Relation === Input tokens: ['Career', 'Sun', 'Microsystems', 'Nadella', 'worked', 'at', 'Sun', 'Microsystems', 'as', 'a', 'member', 'of', 'its', 'technology', 'staff', 'before', 'joining', 'Microsoft', 'in', '1992.[25', ']'] Output Confidence: 0.96603364 ; Subject: Sun Microsystems ; Object: Nadella ; Adding to set of extracted relations ========== Processed 20 / 69 sentences === Extracted Relation === Input tokens: ['At', 'Microsoft', ',', 'Nadella', 'has', 'led', 'major', 'projects', 'that', 'included', 'the', 'company', "'s", 'move', 'to', 'cloud', 'computing', 'and', 'the', 'development', 'of', 'one', 'of', 'the', 'largest', 'cloud', 'infrastructures', 'in', 'the', 'world.[26', ']', 'Nadella', 'worked', 'as', 'the', 'senior', 'vice', '-'] Output Confidence: 0.9784718 ; Subject: Microsoft ; Object: Nadella ; Adding to set of extracted relations ========== === Extracted Relation === Input tokens: ['Nadella', 'worked', 'as', 'the', 'senior', 'vice', '-', 'president', 'of', 'research', 'and', 'development', '(', 'R&D', ')', 'for', 'the', 'Online', 'Services', 'Division', 'and', 'vice', '-'] Output Confidence: 0.9889404 ; Subject: the Online Services Division ; Object: Nadella ; Adding to set of extracted relations ========== === Extracted Relation === Input tokens: ['Nadella', 'worked', 'as', 'the', 'senior', 'vice', '-', 'president', 'of', 'research', 'and', 'development', '(', 'R&D', ')', 'for', 'the', 'Online', 'Services', 'Division', 'and', 'vice', '-', 'president', 'of', 'the', 'Microsoft', 'Business', 'Division.[27', ']'] Output Confidence: 0.9894896 ; Subject: the Microsoft Business Division.[27] ; Object: Nadella ; Adding to set of extracted relations ========== Processed 25 / 69 sentences Processed 30 / 69 sentences Processed 35 / 69 sentences === Extracted Relation === Input tokens: ['Microsoft', 'acquired', 'GitHub', 'for', 'US$', '7.5', 'billion.[53', ']', 'As', 'of', 'November', '2023', ',', 'Microsoft', 'stock', 'had', 'increased', 'nearly', 'tenfold', 'since', 'Nadella', 'became', 'CEO', 'in', '2014', ','] Output Confidence: 0.9933629 ; Subject: Microsoft ; Object: Nadella ; Adding to set of extracted relations ========== === Extracted Relation === Input tokens: ['Microsoft', 'acquired', 'GitHub', 'for', 'US$', '7.5', 'billion.[53', ']', 'As', 'of', 'November', '2023', ',', 'Microsoft', 'stock', 'had', 'increased', 'nearly', 'tenfold', 'since', 'Nadella', 'became', 'CEO', 'in', '2014', ','] Output Confidence: 0.99282503 ; Subject: GitHub ; Object: Nadella ; Adding to set of extracted relations ========== === Extracted Relation === Input tokens: ['Microsoft', 'stock', 'had', 'increased', 'nearly', 'tenfold', 'since', 'Nadella', 'became', 'CEO', 'in', '2014', ','] Output Confidence: 0.99049056 ; Subject: Microsoft ; Object: Nadella ; Duplicate with lower confidence than existing record. Ignoring this. ========== Processed 40 / 69 sentences Processed 45 / 69 sentences Processed 50 / 69 sentences Processed 55 / 69 sentences === Extracted Relation === Input tokens: ['"', 'Microsoft', 'CEO', 'Satya', 'Nadella', 'Once', 'Gave', 'Up', 'His', 'Green', 'Card', 'For', 'Love', '"'] Output Confidence: 0.8068665 ; Subject: Microsoft ; Object: Satya Nadella ; Duplicate with lower confidence than existing record. Ignoring this. ========== Processed 60 / 69 sentences Processed 65 / 69 sentences Extracted annotations for 6 out of total 69 sentences Relations extracted from this website: 12 (Overall: 15) URL ( 3 / 10): https://news.microsoft.com/exec/satya-nadella/ Fetching text from url ... Webpage length (num characters): 35 Annotating the webpage using spacy... Extracted 1 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Extracted annotations for 0 out of total 1 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 4 / 10): https://www.linkedin.com/in/satyanadella Fetching text from url ... Webpage length (num characters): 1 Annotating the webpage using spacy... Extracted 1 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Extracted annotations for 0 out of total 1 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 5 / 10): https://www.youtube.com/watch?v=4GLSzuYXh6w Fetching text from url ... Webpage length (num characters): 215 Annotating the webpage using spacy... Extracted 1 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Extracted annotations for 0 out of total 1 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 6 / 10): https://x.com/satyanadella?lang=en Fetching text from url ... Webpage length (num characters): 251 Annotating the webpage using spacy... Extracted 5 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Processed 5 / 5 sentences Extracted annotations for 0 out of total 5 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 7 / 10): https://www.microsoft.com/en-us/microsoft-365/blog/2020/04/30/2-years-digital-transformation-2-months/ Fetching text from url ... Webpage length (num characters): 2103 Annotating the webpage using spacy... Extracted 5 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Processed 5 / 5 sentences Extracted annotations for 0 out of total 5 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 8 / 10): https://blogs.microsoft.com/blog/2020/06/23/addressing-racial-injustice/ Fetching text from url ... Webpage length (num characters): 35 Annotating the webpage using spacy... Extracted 1 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Extracted annotations for 0 out of total 1 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 9 / 10): https://x.com/satyanadella/status/1892242895094313420?lang=en Fetching text from url ... Webpage length (num characters): 251 Annotating the webpage using spacy... Extracted 5 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Processed 5 / 5 sentences Extracted annotations for 0 out of total 5 sentences Relations extracted from this website: 0 (Overall: 0) URL ( 10 / 10): https://blogs.microsoft.com/blog/2023/01/18/subject-focusing-on-our-short-and-long-term-opportunity/ Fetching text from url ... Webpage length (num characters): 35 Annotating the webpage using spacy... Extracted 1 sentences. Processing each sentence one by one to check for presence of right pair of named entity types; if so, will run the second pipeline ... Extracted annotations for 0 out of total 1 sentences Relations extracted from this website: 0 (Overall: 0) ================== ALL RELATIONS for org:top_members/employees ( 16 ) ================= Confidence: 0.9933629 | Subject: Microsoft | Object: Nadella Confidence: 0.99282503 | Subject: GitHub | Object: Nadella Confidence: 0.9918389 | Subject: Microsoft | Object: Satya Nadella Confidence: 0.9909373 | Subject: Microsoft | Object: Steve Ballmer Confidence: 0.9902232 | Subject: Microsoft | Object: John W. Thompson Confidence: 0.9899277 | Subject: the Gates Foundation | Object: Gates Confidence: 0.9894896 | Subject: the Microsoft Business Division.[27] | Object: Nadella Confidence: 0.9889404 | Subject: the Online Services Division | Object: Nadella Confidence: 0.98870194 | Subject: Indian Administrative Service | Object: Bukkapuram Nadella Yugandhar Confidence: 0.98368984 | Subject: Melbourne Accelerator Program | Object: Heath Evans1.4K Followers·3.3K FollowingMarketing & Communications Confidence: 0.97394663 | Subject: MicrosoftSpouse | Object: Satya Narayana Nadella Confidence: 0.96966493 | Subject: MicrosoftSpouse | Object: active1992 Confidence: 0.9689595 | Subject: French Gates | Object: Gates Confidence: 0.96603364 | Subject: Sun Microsystems | Object: Nadella Confidence: 0.78589326 | Subject: Gates Ventures Technology | Object: Melinda Confidence: 0.76217777 | Subject: Microsoft | Object: Gates Total # of iterations = 2