Assignment 3

Task 1 will contribute 20% to the total grade of the assignment and task 2 will contribute 80%.

Task 2 is, roughly, twice the length of the previous individual coding tasks. Make sure you allocate enough time to work through it.

Submission instructions

This assignment is due by 11:59 pm on Saturday, November 9. All tasks for this assignment should be submitted via Gradescope. Make sure you double-check your submission to ensure it satisfies all the items in these checklists:

  • File formatting and uploading:
      • hwk3-task2-oil-spills-YOURLASTNAME.ipynb and
  • Notebook content checklists:

Resubmissions after the due date due to not satisfying one of the checks above will be strictly held to the course’s 50%-regrade resubmission policy (see syllabus).

If you have any questions about assignment logistics, please reach out to the TA or instructor by 5 pm Friday, November 8.

Task 1: FAIR and CARE principles reading

In this task, you’ll explore two key frameworks in data governance. The CARE principles [1] guide us in thinking ethically about Indigenous data, focusing on respect, benefit, and the right of communities to control their own data. The FAIR principles [2], on the other hand, encourage us to make data easy to find, use, and share, emphasizing machine-actionability.

Read the following resources and write a one-paragraph (between 100 and 150 words) reflection about them:

Review the rubric for this assignment here. Answer at least one of the following questions for your reflection:

  1. The FAIR principles promote openness and accessibility, while the CARE principles emphasize control and ethics in Indigenous data. Can you think of an example where balancing openness with ethical considerations was or would have been important?

  2. The CARE principles stress the authority of communities over their own data. In your own experience, who has typically held authority over data? How might shifting that authority impact the outcomes or perceptions of data projects you’ve been involved in?

  3. How do the CARE principles challenge or expand your understanding of data management? Have you encountered situations —personally or professionally— where data governance might have benefited from a more community-centered approach?

Ready to submit your answer? Make sure your submission follows the checklist at the top of the assginment!

Setup for task 2

  1. Fork this repository: https://github.com/MEDS-eds-220/eds220-hwk3

  2. You may also work locally using the eds-220 environment or in the workbench-1 server using the EDS-220 kernel.

  3. Using the terminal, clone your fork of the eds220-hwk3 repository into your eds-220 directory.

Task 2: Visualizing oil spills in NY State

In this task you will use data from the New York State government about spills of petroleumn and other hazardous materials to visualize trends in the spatial distribution of the spills.

Workers from the Miller Environmental Group (MEG) dispose of oil particles from a 500 gallon oil spill near Jones Beach, N.Y., Friday, Nov. 23, 2007.

Follow the instructions in the notebook hwk3-task2-oil-spills.ipynb to complete this task. Review the rubric for this assignment here.

Ready to submit your answers? Make sure your submission follows the checklist at the top of the assginment!

References

[1]
S. Carroll et al., “The CARE principles for indigenous data governance,” Data science journal, vol. 19, 2020.
[2]
M. D. Wilkinson et al., “The FAIR guiding principles for scientific data management and stewardship,” Scientific data, vol. 3, no. 1, pp. 1–9, 2016.
[3]
H. Do-Linh, C. Galaz García, M. B. Jones, and C. Vargas Poulsen, Open Science Synthesis training Week 1. NCEAS Learning Hub & Delta Stewardship Council. 2023. Available: https://learning.nceas.ucsb.edu/2023-06-delta/