shreyanid 6db663e7bb
refactor: separate click wrappers from core evaluation functionality (#1981)
### Summary
Click decorated functions cannot (properly) be called outside of the
click interface. This makes it difficult to reuse the setup
functionality in measure_text_edit_distance or
measure_element_type_accuracy. This PR removes the click decoration and
separates it into a wrapper function purely to execute the command.

### Technical Details
- Changed as suggested in [this StackOverflow
post](https://stackoverflow.com/questions/40091347/call-another-click-command-from-a-click-command)
response
- The locations of these now distinct functions are separate: the
`_command` click-decorated functions stay in ingest/evaluate.py, and the
core functions measure_text_edit_distance and
measure_element_type_accuracy are moved into the unstructured/metrics/
folder (which is a more logical location for them).
- Initial test added for measure_text_edit_distance

### Test
`sh ./test_unstructured_ingest/evaluation-metrics.sh text-extraction`
functionality is unchanged.

---------

Co-authored-by: ryannikolaidis <1208590+ryannikolaidis@users.noreply.github.com>
Co-authored-by: shreyanid <shreyanid@users.noreply.github.com>
Co-authored-by: Trevor Bossert <37596773+tabossert@users.noreply.github.com>
2023-11-07 19:54:22 +00:00

37 lines
1.5 KiB
Plaintext

Bank Good Credit
Accredited with IABACTM
(International Association of Business Analytics Certifications)
IABAC International Association of
Business Analytics Certification
DataMitesTM. All Right Reserved
Objective & Background
Classify credit card customers as good / bad, based on information from internal and external sources.
Data provided
Demographic: Base file of with credit card history details. Only one record for every customer.
Account: Contians data for various loans availed by the customer. Not related to credit card. Multiple records for every customer.
Enquiries: Enquired made by customers for different loan purposes. Multiple records for every customer.
DataMitesTM. All Right Reserved
Design
Data to be downloaded using SQL queries.
Required information to be extracted from Account and Enquiry files and converted to one-to-one files.
The columns from the two files should be merged with Demographic file using Left Join with customer no as key column, to create a final file. The final file should contain all the records in demographic and additional columns/features from Account and Enquiry files will get added to Demographic file.
There will be many customers in account and enquiry file who will get left out. This is fine as we anyway dont know their good/bad label for training purpose.
DataMitesTM. All Right Reserved
Analysis of Data
Show using Excel File
DataMitesTM. All Right Reserved
Explain Coding / outcomes
Show using Jupyter
DataMitesTM. All Right Reserved
Thank You
DataMitesTM. All Right Reserved