Based on the limited information, I assume that it is difficult to build a quantitative model.
The reason is following,
1. it is hard to measure the following criteria:
e15, e16,e17,e18
2. it is also hard to get complete e19 information
3. you may access to e20 and e21, but the cost may beyond its benefit
Another issue,
the dependent variable you mentioned is more like a fraud.
If you want to build a model anticipate it, there is a famous model.
The independent variables are:
financial pressure to XXX
opportunity to XXX
rationalization of unprofessional conduct (e.g. XXX).
Facing three variables, your load will be much smaller.
And the data may be easier to collect.
e15,e16 and e17 mainly describe CEO or business owner's personal interest.
You may combine them as one.
It will be much simpler.
Another issue,
the words you use is very colorful, e.g. "群众\曝光"
Academic world prefer to neutral style.
"Whistle-blower, public monitoring" will be much professional.
The last but not the least,
I don't think regulation is one decision variable.
Because they are there no matter your choose to nondisclosure or disclosure your accident.
I am a student too,
just want to discuss with you about your model.
I hope it helps.
Good luck.