Finish implementing confirmation workflow #144

matt-bernhardt · 2024-11-20T19:37:00Z

This finishes implementing the confirmation workflow. Full details are in the commit message, including a somewhat lengthy discussion of some side effects and concerns.

Developer

Ticket(s)

https://mitlibraries.atlassian.net/browse/TCO-101

Accessibility

ANDI or Wave has been run in accordance to our guide and
all issues introduced by these changes have been resolved or opened
as new issues (link to those issues in the Pull Request details above)
There are no accessibility implications to this change

Documentation

Project documentation has been updated, and yard output previewed
No documentation changes are needed

ENV

All new ENV is documented in README.
All new ENV has been added to Heroku Pipeline, Staging and Prod.
ENV has not changed.

Stakeholders

Stakeholder approval has been confirmed
Stakeholder approval is not needed

Dependencies and migrations

NO dependencies are updated

NO migrations are included

Reviewer

Code

I have confirmed that the code works as intended.
Any CodeClimate issues have been fixed or confirmed as
added technical debt.

Documentation

The commit message is clear and follows our guidelines
(not just this pull request message).
The documentation has been updated or is unnecessary.
New dependencies are appropriate or there were no changes.

Testing

There are appropriate tests covering any new functionality.
No additional test coverage is required.

JPrevost · 2024-11-22T13:37:31Z

@matt-bernhardt the i18n check is interesting. I'll start a thread on Slack to discuss approaches. TLDR though: it's not a default enabled cop, but our config auto enables new cops even when they are pending. I think I like that option as it will alert us when a new cop detects something in our codebase, but when it does I think we should discuss whether we want to adopt it rather than just accepting it. This discussion is what I'll move to slack (i.e. do we want to enforce https://docs.rubocop.org/rubocop-rails/cops_rails.html#railsi18nlocaletexts in this repo).

app/controllers/confirmation_controller.rb

JPrevost · 2024-11-22T14:39:43Z

app/controllers/confirmation_controller.rb

+  end
+
+  # The confirmation form lists options for each Category record, and then one extra option to flag the term for
+  # removal. "Flag" is not a category, but a separate boolean field on the Confirmation model.


Did you consider having Flagged for removal be a category or a boolean on a Term itself? (I know that landed in a previous commit and I'm not asking for a change, I'm just curious as something being flagged will be best to be removed from all future queries immediately (to be reviewed by us probably) so putting the flag on Term would make it easy to suppress it from all user facing views via a scope. It could be done also with the flag here, but that requires additional more complex queries any time a Term is being looked up. Not a big deal either way, I'm mostly just curious what was considered when making this decision.

I'm nervous to actually delete flagged queries every because by keeping them but suppressing them allows us to always hide them from users whereas deleting them would just allow them to resurface again if someone were ever to search for that term again. I guess our review of suppressed queries might have some thought into how likely it would be to recur and need to be re-suppressed again. Hmmm.

I remember discussing a few options, but I'm not sure I could find where now. I do remember thinking about whether we needed 3, 4, or 5 categories - and settling on 4 with the addition of Undefined, which humans can use but the knowledge graph does not. A Flagged category didn't seem to make sense, based on the assumption that we would actually be deleting terms. If we don't actually delete terms, then perhaps that might make sense to revisit.

My thought about suppressing versus deleting terms is that we shouldn't store records we shouldn't have - keeping them in the database invites problem if the application leaks data or is breached. If a term gets re-submitted, we'll need to delete it again, but I'm not sure it's worth keeping things around in case a duplicate ever gets submitted (remembering how rare repeated searches are in the first place).

I do remember thinking about a flag on the Term, and decided against that approach because having the flag on the Confirmation preserves information about who flagged it - in case follow-up is needed about why they flagged it.

Yeah this is complicated. There aren't many reasons we'd want to flag/delete a term and if we hit one I'm nervous to not have a way to prevent it from being shown to all future validators, keep it out of any data exports, etc. I understand your concern about wanting to delete data we shouldn't have, but I'm waffly on whether it's better to know we shouldn't have it (by flagging it as suppressed at the Term level) to make sure it's never included in our work versus having it continue to show up again in the future. I can probably be brought around to your view based on the infrequency of repeat terms, but by deleting things -- contrary to how it may feel -- we'll never actually know for certain we are preventing it from being seen from anyone (other than us as system maintainers who have raw db access). Hmm.

matt-bernhardt · 2024-11-25T21:54:20Z

Okay, @JPrevost - I've pushed commits that resolve many of your concerns, and extends the (already-existing, it turns out) exclusion of the i18n cop to include the Confirmation controller. That resolves the issue for CodeClimate.

Remaining is the question of how to handle flagged Terms. My thoughts on that, at the moment, are these:

Nothing in this PR actually implements what we do about flagged terms - so I don't know that we need to decide in this PR whether we're going to delete flagged terms, or build a way to suppress them. Either option will require development work beyond the current scope of this ticket.
Given the current data model for Confirmations, I think the PR as currently structured is the way to implement the rest of the confirmation workflow. The question is whether we want to change the data model or not, which leads me to:
I'm ambivalent about whether to switch to five categories, or to stay with four categories and a boolean. Having five categories would make this PR simpler, which is somewhat compelling. We'd drop the boolean flag from the Confirmation model, and add a fifth seed value - updating tests, controller, and view templates along the way. I do have a pretty strong preference that the updated data model continue to store who flagged the term - which I think we'd get with a fifth category.

What are your thoughts at this point?

JPrevost · 2024-11-26T14:22:31Z

@matt-bernhardt this may be something best discussed synchronously, but I'll try to share my thoughts here.

I think the best place to flag a Term we are concerned about is in the Term model itself.

I don't think using a Category of "Flagged for review" or a boolean on a confirmation provides the affordances we'll want to suppress the Term from being used until we have reviewed and possibly chosen to delete it.

A future scope on Term "without_flagged" (other names might be "displayable" or something that makes it clear what the intent is) could be used consistently throughout the application to prevent anyone from seeing flagged results until such time as we as data maintainers either remove the flag as erroneous or delete the Term.

Using a Category or boolean flag on a Confirmation could also be used to create a scope, but it feels much more complicated to reason about as any time we use a Term the scope would need to look to other tables to determine if it is displayable. As we'll have so few flagged Terms, this seems like it's more work with minimal gain.

With all that said:

I agree with your statement that actually deleting (or not) is out of scope of this work. I still think that means we should assume we'll want to either suppress or delete later and make decisions that feel most conducive to that as if we don't plan on doing anything ever then we might as well not flag at all.
I would be okay with removing the ability to flag from this PR entirely and opening a ticket to plan that feature more thoroughly (including deciding if we will immediately suppress and delete later or something else for flagged Terms)
I would be okay with creating a boolean on Term in this PR
I would be okay with using a Category to flag Terms
I would accept, but anticipate changing in the future, using a boolean on Confirmation. My reluctant acceptance comes down to it being fairly easy to change it later. It still feels like the incorrect data model to me as we aren't flagging a Confirmation, we are flagging a Term... but it will work and when we build out the feature more thoroughly we can change it.

matt-bernhardt · 2024-11-26T22:36:35Z

I can see a solid rationale for adopting a scope on the Term model for "records that have not been flagged" - and that we would soon want to enforce that scope in nearly every interaction where the application asks for a list of Terms. This also feels like it calls for an interface to show which records have been hit by this flag, although the interface doesn't feel like it's in scope of the current ticket.

I'm less clear whether the creation of this scope would be part of this ticket, although the conversion of all existing references to use the scope feels like it would be future work.

What about something like this as a middle ground, at least for the short term but possibly for longer:

Change the data model here to create a Flagged category, and update the Confirmation model to refer only to the category value (dropping the flagged boolean field).
Create a new flagged field on the Term data model, with the future intent to hang the scope and everything else off this field.
Update the create method in this PR to set the Term.flagged field in addition to recording the details of the user's Confirmation submission.

This feels like it would achieve both of our goals:

The user's Confirmation would be preserved, allowing us to follow up with needed about why a flag was thrown. Having the field as part of the user submission also allows us to support Confirmations from multiple users without loss of data.
Simultaneously setting a flagged boolean on the Term model serves the operational needs of the application in a more efficient way, as you note that leveraging this scope will be much simpler in database terms.

Both the record of the user's activity, and the operation needs of the application, are preserved - at the cost of a slight duplication of information. However, this duplication will come in handy if we ever decide that the flag is not justified, because we can release a term for processing while still having some breadcrumb that it was once flagged (just in case that's ever a question).

I'd be comfortable with this as a route forward, and am willing to refactor the current PR toward this goal. There would then be future tickets to build the scope, leverage it across the application, and build an interface to show admins (only) what terms have been flagged.

JPrevost · 2024-11-27T13:00:36Z

@matt-bernhardt Yup. That makes a ton of sense to have both the Category and Term boolean for the reasons you stated. Thanks for talking this through to help figure out what made sense to build towards future features versus what the future features are that we can ticket but not implement at this time.

matt-bernhardt · 2024-12-04T16:17:23Z

I'm thinking that rebasing this branch past the recent merges will be fine, as long as I keep the current three commits unchanged - but I want to confirm this before force-pushing.

JPrevost · 2024-12-04T18:25:11Z

@matt-bernhardt yes that will be fine. Leave the commits, but rebase as need will work for me

** Why are these changes being introduced: A previous PR started the implementation of our confirmation workflow, but did not include a form in the view template, nor the controller logic to receive the form submissions. ** Relevant ticket(s): * https://mitlibraries.atlassian.net/browse/tco-101 ** How does this address that need: This finishes the job of the previous PR, completing the confirmation workflow. More specifically, it: * Updates the routes information, moving the "index" display to a better route name, and defining Confirmations as resources under Terms. This makes dealing with routing more logical, and takes advantage of Rails' magic. Only the new and create methods are implemented. * Moves the confirmation form out of the Terms controller, and into a new Confirmation controller, which is where records like this should be managed, in the Confirmation#new method. * Form submissions are received in Confirmation#create, which is helped by two new private methods for dealing with category values (which are either a category_id or a "flag" boolean), and for user feedback based on the create result. * The site_nav and tests are updated to reflect the new controller and route information * The view templates are updated, most notably the new confirmation template, which now has a working form using Rails' built-in form tool. ** Document any side effects to this change: The confirmation form has some inline styles to quickly get the UI into something not-awful. Rubocop is complaining about lack of translation in the feedback messages in the Confirmation controller, The final else clause in the Confirmation#feedback_for method is not something I can figure out a test for - but I'm loathe to not have a default clause in that conditional. The .save method can return false, but usually that means an error message, which I've provided a catch for the most likely scenario (and a test showing how it gets produced). I'm not sure about one of the confirmation controller tests - the one confirming that terms disappear from the list after a confirmation is created. We already have a test for the scope that provides this list, but it felt appropriate to try and test at the controller level as well? I don't remember writing tests like this in the past, though - so maybe it isn't relevant here.

* Method comments now start with the method name, not "This method" * feedback_for comment rewritten to avoid first-person statement * Sentry messages generated in two relevant blocks (rescue method and the else block of a conditional)

The biggest change here is that we are changing the data model, moving the flag field from the Confirmation record to the Term record. This clears the way for Confirmation to treat Category as a required field, which explains the three migrations in this commit. Because "flagged" is now a category in Confirmation terms, we add that to the seeds file. Along with the data model change, we update the new confirmation form to more explicitly label fields as ID fields, in addition to no longer having a separate list entry for flagging terms. Through all of the above, the controller shifts a bit, as set_category is no longer needed to manage received values. In its place, there is now a flag_term method for setting the new flag on the related term - as long as the confirmation saves correctly. There is also a guard clause for when the confirmation does not save correctly, which means the feedback_for method gets a little simpler. --- Next up is a slight refator to adopt a strong parameters approach, which will have some additional changes with it.

This changes the confirmation controller to adopt a strong parameters posture, which includes explicitly adding the term ID and user ID fields to the submitted form (rather than handling them implicitly from other sources). During testing we also abandon the guard clause and .create methods in the controller, in favor of .new and a more explicit if/else block on the .save method. During testing, this proved necessary to confirm that the submitted record is actually saved, as the single .create step I was trying failed silently with no feedback. The "submitting a confirmation form without all fields shows an error" test is a reflection of this difficulty, confirming that the application does not quietly swallow anything if a form goes awry. Ditto with the assertions that look for confirmation counts pre- and post- action.

JPrevost

Looks good. Two minor suggested documentation updates (one to remove/move a line that is no longer accurate should definitely be fixed, the other is a suggestion.

JPrevost · 2024-12-10T20:07:03Z

app/controllers/confirmation_controller.rb

+
+  # feedback_for takes the result of the confirmation.save directive above and sets an appropriate flash message.
+  #
+  # The final else clause is likely to be difficult to provoke, so we are sending a Sentry message in that block in


I think this line of documentation should be removed or moved as it does not reflect the current state of the method it is documenting.

Good catch - I've rewritten this entire comment block. Thanks!

JPrevost · 2024-12-10T20:09:34Z

app/controllers/confirmation_controller.rb

+    t.save
+  end
+
+  # confirmation_flag? compares the submitted category (coerced to an integer) to the ID value for the "flagged" category. We


We do this at least twice in this controller. makes me nervous as it would be easy to miss updating this type of statement when the logic in the controller changes.

JPrevost · 2024-12-10T20:18:09Z

app/controllers/confirmation_controller.rb

+  # create receives the submission from the new confirmation form, creating the needed record with the help of various
+  # private methods.
+  def create
+    new_record = Confirmation.new(confirmation_params)


Strong params cleaned this up nicely. Thanks for going through that process.

When I saw what the final diff ended up being, I agree that it makes things much simpler. I'm not completely happy that it doesn't seem to be possible to chain .require() statements in the way I originally expected, but this ended up being a bit better than I at first feared.

mitlib temporarily deployed to tacos-api-pipeline-pr-144 November 20, 2024 19:38 Inactive

matt-bernhardt force-pushed the tco-101 branch from 05c193d to 26d113c Compare November 21, 2024 16:06

mitlib temporarily deployed to tacos-api-pipeline-pr-144 November 21, 2024 16:06 Inactive

mitlib temporarily deployed to tacos-api-pipeline-pr-144 November 21, 2024 16:13 Inactive

matt-bernhardt force-pushed the tco-101 branch from 6ec0f0b to 9266ee5 Compare November 21, 2024 16:27

mitlib temporarily deployed to tacos-api-pipeline-pr-144 November 21, 2024 16:27 Inactive

matt-bernhardt force-pushed the tco-101 branch from 9266ee5 to 7e9b658 Compare November 21, 2024 21:57

mitlib temporarily deployed to tacos-api-pipeline-pr-144 November 21, 2024 21:57 Inactive

matt-bernhardt marked this pull request as ready for review November 21, 2024 21:59

matt-bernhardt requested review from JPrevost and jazairi November 21, 2024 21:59

JPrevost self-assigned this Nov 22, 2024

JPrevost requested changes Nov 22, 2024

View reviewed changes

matt-bernhardt force-pushed the tco-101 branch from 7e9b658 to bd39c4f Compare November 25, 2024 20:43

mitlib temporarily deployed to tacos-api-pipeline-pr-144 November 25, 2024 20:43 Inactive

mitlib temporarily deployed to tacos-api-pipeline-pr-144 November 25, 2024 21:28 Inactive

matt-bernhardt force-pushed the tco-101 branch from 5614225 to 5dd9679 Compare December 4, 2024 18:52

matt-bernhardt added 5 commits December 6, 2024 14:00

Respond to code review feedback

aeeebcd

* Method comments now start with the method name, not "This method" * feedback_for comment rewritten to avoid first-person statement * Sentry messages generated in two relevant blocks (rescue method and the else block of a conditional)

ignore Rails/I18nLocaleTexts cop

a22027f

matt-bernhardt force-pushed the tco-101 branch from 5dd9679 to 5e44ac6 Compare December 9, 2024 16:23

JPrevost self-requested a review December 10, 2024 20:22

JPrevost approved these changes Dec 10, 2024

View reviewed changes

Update method documentation from code review

07a9189

matt-bernhardt merged commit 8e4fefe into main Dec 11, 2024
5 of 6 checks passed

matt-bernhardt deleted the tco-101 branch December 11, 2024 15:17

Finish implementing confirmation workflow #144

Finish implementing confirmation workflow #144

Uh oh!

Conversation

matt-bernhardt commented Nov 20, 2024 • edited by JPrevost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Developer

Ticket(s)

Accessibility

Documentation

ENV

Stakeholders

Dependencies and migrations

Reviewer

Code

Documentation

Testing

Uh oh!

JPrevost commented Nov 22, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matt-bernhardt commented Nov 25, 2024

Uh oh!

JPrevost commented Nov 26, 2024

Uh oh!

matt-bernhardt commented Nov 26, 2024

Uh oh!

JPrevost commented Nov 27, 2024

Uh oh!

matt-bernhardt commented Dec 4, 2024

Uh oh!

JPrevost commented Dec 4, 2024

Uh oh!

JPrevost left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

matt-bernhardt commented Nov 20, 2024 •

edited by JPrevost

Loading