Skip to content

Latest commit

 

History

History
21 lines (18 loc) · 1.33 KB

File metadata and controls

21 lines (18 loc) · 1.33 KB

LLM Ethics Library

A Library for assessing the ethical capablilites of LLMs through prompting them with ethical dilemmas.

Functions:

  • Generating many variations of prompts
    • to make sure irrelevant factors like "output option ordering" or "dilemma formulation" have no siginficant impact on the results
  • Prompting of LLMs (currently only OpenAI ChatGPT)
  • Provides wrapper classes for Prompts and Responses to make working with them easier
    • Importing and Exporting from/to JSON is supported
  • Previously generated prompts & responses can be found in the data directory

Read before using!

  • The content (prompts & responses) of the different versions are not mutually exclusive. When merging different versions together, make sure to check for duplicates and remove them.

Results

Results of v1.4 (still work in progress!)

It seems like GPT-4o's behaviour aligns mostly with Deontology and Virtue Ethics when looking at the crying baby dilemma. The graph below plots the decision to kill (YES) or spare (NO) the child in the following situation: A community hides from armed invaders. A crying child endangers their secrecy. image