Show HN: NLP Flashcards for Most of the Internet https://ift.tt/3zqcWop

Show HN: NLP Flashcards for Most of the Internet Hello HN! We're Sam and Kanyes. We're building an extension to help you remember what you read online. We're calling it Ferret [1]. When you open Ferret on an HTML page, it generates recall-based questions + answers to reinforce key concepts with NLP. Consider the following toy example where we open Ferret on an explanation of Bayesian statistics. [2] Q: What does the frequentist interpretation view probability as? A: the limit of the relative frequency of an event after many trials Q: What is often computed in Bayesian statistics using mathematical optimization methods? A:The maximum a posteriori We do this by (1) Parsing the DOM tree of an HTML page for

tags on the client, and segmenting these into preprocessed chunks (2) Performing inference on question-generation with a T5-base model pretrained on SQuAD (3) Extractive question-answering with the chunk & question we've generated with RoBERTa, also pretrained on SQuAD. No GPT-3 here— where's the fun in an API call when you can do it yourself. Ferret is built as a React.JS app deployed as a chrome extension, with models hosted on AWS Sagemaker. Finally, why could this be helpful? Human memory is lossy. Psychologists have shown for forever that your memory can be modeled with a forgetting curve. If you don't attempt to retain knowledge, you'll likely lose it. But most of the content we read online (technical blog posts, documentation, course notes, articles) gets ingested and quickly forgotten. We're interested in low-friction approaches to helping people better remember this content , starting with fellow engineers who depend on their ability to remember key concepts to do the best job. We've open-sourced the full repo and are actively responding to PRs + issues. [3]. You can read more about the technical + product challenges we faced if that interests you as well. [4] We appreciate all feedback and suggestions! [1]https://ift.tt/3n02UqT [2] https://ift.tt/3zAr5iU [3] https://ift.tt/2WFb0K0 [4] https://ift.tt/38wYuyV August 31, 2021 at 03:00AM

Komentar

Postingan populer dari blog ini

Show HN: Interactive exercises for GNU grep, sed and awk https://ift.tt/OxeFwah

Show HN: Create demos & guides just with a simple prompt https://ift.tt/HfWo3mz