Exposure Graduate Qualifying Project - Fall 2018 Huanhan Liu | - PowerPoint PPT Presentation

https://medium.com/greyatom/introduction-to-natural-language-processing- 78baac3c602b

Understanding Data Data Cleaning Methodology Natural Language Processing Neural Network

Understanding the Data

Understanding the Data • Flag Words – impact, affect, contaminate… • Media – soil, groundwater, indoor air… • Modifier – greater than, less than… • Chemical – CVOCs, gasoline, petroleum, TCE…

Data Cleaning • Compile reports/tech screen scores • Unlock reports • Extract PDF reports to text • Identify/aggregate keyword/flag word sentences • Images/Tables? • Eliminate non-essential numeric characters • Annotate extracted sentences https://www.invensis.net/blog/data-processing/5- advantages-of-data-cleansing/

Natural Language Processing Word To Vector • Term Frequency - Inverse Document Frequency • Skip-Gram / Neighbor words prediction https://primer.ai/blog/Chinese-Word-Vectors/

Neural Network Artificial neural networks are computing systems vaguely inspired by the biological neural networks that constitute animal brains. (https://en.wikipedia.org/wiki/Artificial_neural_network) https://dzone.com/articles/an-introduction-to-the-artificial-neural-network

Neural Network Long Short Term Memory For Example: You remember you eat Long short-term memory model is a recurrent neural network composed of units/cells with an input gate, an lunch, how to eat lunch, what you like output gate and a forget gate. The cell remembers for lunch, what you had for lunch, and values over arbitrary time intervals and the three gates you try new foods for lunch that you regulate the flow of information into and out of the may or may not like cell. (https://en.wikipedia.org/wiki/Long_short-term_memory) Convolutional Neural Network For Example: If you look at a small A Convolutional Neural Network (CNN) is comprised of portion of a picture of a cat you may one or more convolutional layers and then followed by one or more fully connected layers. The architecture of only see fur, as you move your view a CNN is designed to take advantage of the input frame over the cat picture you see more feature local connections and tied weights followed by cat features, cat ears, cat mouth and some form of pooling which results in translation of cat eyes until you finally realize you are invariant features. looking at a picture of a cat. (h ttp://ufldl.stanford.edu/tutorial/supervised/ConvolutionalNeuralNetwo rk/)

PDF Sentence Extraction

Long Short Term Memory http://www.bbc.co.uk/schools/gcsebitesize/science /add_ocr_21c/brain_mind/complexrev3.shtml

Convolutional Neural Network Convolutional Neural Network https://www.ayasdi.com/blog/artificial-intelligence/using-topological-data-analysis-understand-behavior-convolutional-neural-networks/

Convolutional Neural Network Convolutional Neural Network https://www.researchgate.net/figure/Overview-of-the-basic-CNN-architecture-A-Each-word-within-a-discharge-note-is_fig1_323213106

Long Short Term Memory Result Summary 1. Final test accuracy of model - Positive flag prediction accuracy: 70% - Negative flag prediction accuracy: 90% 2. More training steps increase largely on positive flag prediction accuracy, with a trade off of slight decrease on negative accuracy

Convolutional Neural Network Result Summary 1. Final test accuracy of model - Positive flag prediction accuracy: 96% - Negative flag prediction accuracy: 85% 2. Add punishment when model predict negative but the real situation is positive. Model has a better positive accuracy than negative accuracy.

Summary and Conclusion ● NLP with deep learning methods (CNN and LSTM-RNN) provides a feasible solution for flag condition prediction of text based IRA reports. In both the CNN and LSTM model, prediction performance shows promising results on correctly identifying positive flag conditions based on the collected test reports. ● Further data cleaning, more balanced data sampling, and a more comprehensive model will increase the accuracy on flag condition predictions.

Project Mentors ● Mark E. Baldi, Deputy Regional Director, BWSC ● Matthew Fitzpatrick, BWSC Data Management Coordinator Faculty Advisors ● Elke A. Rundensteiner, Data Science Director, WPI ● Fatemeh Emdad, Data Science Professor, WPI ● Chun-Kit Ngan, Data Science Professor, WPI

GQP MassDEP Fall 2018 Team ● Huanhan Liu, MS Data Science, WPI, hliu7@wpi.edu ● Rushikesh Naidu, MS Data Science, WPI, ranaidu@wpi.edu ● Yi Pan, MS Data Science, WPI, ypan@wpi.edu ● Yun Yue, MS Data Science, WPI, yyue@wpi.edu

Exposure Graduate Qualifying Project - Fall 2018 Huanhan Liu | - PowerPoint PPT Presentation

Machine Estimation of Exposure Graduate Qualifying Project - Fall 2018 Huanhan Liu | Rushikesh Naidu | Yi Pan | Yun Yue Mentors: Mark Baldi | Matthew Fitzpatrick Advisors: Fatemah Emdad | Chun Kit Ngan

Exposure Routes Internal and External Exposure External exposure Internal exposure Body surface

Understand what UV-C is Issues of exposure to UV-C Safety Measures to prevent exposure

Back to Basics Exposure and Depth of Field Woodley PC Members Evening 16 September 2019 Bob

2016 Standards Exposure Survey Results Comment Period February 1 to April 30, 2016 Standards

Fetal exposure to EMF during maternal use of Laptops Fetal exposure of the fetus to EMF during

General Principles of and Examples of Environmental Exposure Assessment Kees de Hoogh Andrea

RF Exposure Procedures RF Exposure Procedures TCB Workshop October 2012 Laboratory Division

Taking into account variability and uncertainty in exposure assessment Prise en compte de la

Exposure Chair: Bill Kraus Members: Wayne Campbell, John Jakicic, Kathy Janz, Ken Powell Exposure

Survey results - Section 1 EXPOSURE TO RISKS EXPOSURE Structure 1. Occupation / education 2.

WITH CONTRIBUTION FROM S Strack, P Davis & W Raskob Acute tritium exposure experiments at

C redit V alue A djustment by Expected Future Exposure M ethod M u M . Liu, CQF Agenda

Co- -exposure of Arsenite and exposure of Arsenite and Co Benzo(a)pyrene: Effect of : Effect

Houston Exposure To Air Houston Exposure To Air Toxics Study (HEATS) Toxics Study (HEATS)

Silica Dust: It will take your breath away Enforms Exposure Control Plan (ECP) October 8, 2015

Exposure Draft, Inventories, Section 3031 SLIDE 1 Exposure Draft, Inventories, Section

The Effects of Hypertext Structure, Presentation, and Instruction Types on Perceived Disorientation

Overview of Near Term Policy Agenda Focus on Regulatory Reform Tobias Adrian, Sean Campbell

North Coastal Prevention Youth Coalition Sticker Shock Who We Are Students from El

VCSO Mechanical Shock Compensation Who are we? Team members: Max Madore Joseph Hiltz-Maher

E. Dupoux Ecole des Hautes Etudes en Sciences Sociales Updated: Dec 2010 Phonological

Local quality control of LHC electrical interconnections during the 2012 shutdown 1 LHC Splice

European Union Project FASTGRID Abstract HVDC (High Voltage Direct Current) super-grids are one

Decision on the ISO 2016-2017 Transmission Plan Neil Millar Executive Director, Infrastructure

Exposure Graduate Qualifying Project - Fall 2018 Huanhan Liu | - PowerPoint PPT Presentation

Machine Estimation of Exposure Graduate Qualifying Project - Fall 2018 Huanhan Liu | Rushikesh Naidu | Yi Pan | Yun Yue Mentors: Mark Baldi | Matthew Fitzpatrick Advisors: Fatemah Emdad | Chun Kit Ngan

Exposure Routes Internal and External Exposure External exposure Internal exposure Body surface

Understand what UV-C is Issues of exposure to UV-C Safety Measures to prevent exposure

Back to Basics Exposure and Depth of Field Woodley PC Members Evening 16 September 2019 Bob

2016 Standards Exposure Survey Results Comment Period February 1 to April 30, 2016 Standards

Fetal exposure to EMF during maternal use of Laptops Fetal exposure of the fetus to EMF during

General Principles of and Examples of Environmental Exposure Assessment Kees de Hoogh Andrea

RF Exposure Procedures RF Exposure Procedures TCB Workshop October 2012 Laboratory Division

Taking into account variability and uncertainty in exposure assessment Prise en compte de la

Exposure Chair: Bill Kraus Members: Wayne Campbell, John Jakicic, Kathy Janz, Ken Powell Exposure

Survey results - Section 1 EXPOSURE TO RISKS EXPOSURE Structure 1. Occupation / education 2.

WITH CONTRIBUTION FROM S Strack, P Davis &amp; W Raskob Acute tritium exposure experiments at

C redit V alue A djustment by Expected Future Exposure M ethod M u M . Liu, CQF Agenda

Co- -exposure of Arsenite and exposure of Arsenite and Co Benzo(a)pyrene: Effect of : Effect

Houston Exposure To Air Houston Exposure To Air Toxics Study (HEATS) Toxics Study (HEATS)

Silica Dust: It will take your breath away Enforms Exposure Control Plan (ECP) October 8, 2015

Exposure Draft, Inventories, Section 3031 SLIDE 1 Exposure Draft, Inventories, Section

The Effects of Hypertext Structure, Presentation, and Instruction Types on Perceived Disorientation

Overview of Near Term Policy Agenda Focus on Regulatory Reform Tobias Adrian, Sean Campbell

North Coastal Prevention Youth Coalition Sticker Shock Who We Are Students from El

VCSO Mechanical Shock Compensation Who are we? Team members: Max Madore Joseph Hiltz-Maher

E. Dupoux Ecole des Hautes Etudes en Sciences Sociales Updated: Dec 2010 Phonological

Local quality control of LHC electrical interconnections during the 2012 shutdown 1 LHC Splice

European Union Project FASTGRID Abstract HVDC (High Voltage Direct Current) super-grids are one

Decision on the ISO 2016-2017 Transmission Plan Neil Millar Executive Director, Infrastructure

WITH CONTRIBUTION FROM S Strack, P Davis & W Raskob Acute tritium exposure experiments at