Skip to content

πŸ”¬πŸš¨πŸ“ Sci-Guard is an AI tool that scans research papers for banned words and suggests revisions. Consider it your second (government) reviewer.

Notifications You must be signed in to change notification settings

thatcatcancode/sci-guard

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

72 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Sci-Guard πŸ”¬πŸš¨πŸ“

See it in action -> https://sciguard.netlify.app/

What

Sci-Guard is an AI tool that scans research papers for banned words and suggests replacements. Consider it your second (government) reviewer.

Why

Protects important scientific research from being mistakenly overlooked and unfunded due to banned words. Saving science one mistakenly-banned word at a time.

Banned Words

There is no single source of truth where an official list is hosted for public use. As such, this list may be incomplete or over-reaching.

Words also sourced from NYT report, said to be specific to NIH grant funding.

Words were also sourced from pen.org

It is often subwords that are discovered problematic and therefore this project aims to replace those occurrences as well.

For example: trans in transcriptomics might be flagged with a suggested revision of gene expression analysis.

Workflow

  • Scientists upload research grant proposal (.txt, .docx, or .pdf).

  • Scans the text for banned words or phrases.

  • Returns a summarized report

  • Returns the problematic sentences with the banned words highlighted.

  • Suggests replacement sentences, using generative AI.

πŸ–ΌοΈ Frontend (Web UI)

  • React
  • Components: File upload dropzone, Chart.js, Results view (problematic sentence + suggested fix)
  • Cloudfare Turnstile to verify user is not a robot (optional)

🧠 Backend (FastAPI)

Endpoints:

Swagger Docs

  • POST /paper/analyze: handles file upload, returns flagged sentences and analysis of paper
  • POST /sentence/rewrite: rewrites sentence without banned words using LLM, recursive

βš™οΈ NLP/AI Components

Keyword Scanner: ntlk / spaCy / custom dictionary-based matching

Generative AI: OpenAI GPT-4x

In the context of GPT and other large language models (LLMs), a context window refers to the maximum amount of text the model can process at one time when generating a response. It's measured in tokens, which are roughly equivalent to words, and includes both the user's input (prompt) and the model's own generated text

About

πŸ”¬πŸš¨πŸ“ Sci-Guard is an AI tool that scans research papers for banned words and suggests revisions. Consider it your second (government) reviewer.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published