# Project 3: 911 Call Data Mining

## Data Mining

* Association: &#x20;

  Apriori Algorithm
* Supervised learning (classification)

  Decision Tree &#x20;

  Bayesian Classification &#x20;

  SVM-Support Vector Machine  &#x20;

  K-Nearest Neighbor &#x20;

  Random Forest &#x20;

  Two approaches to **avoid overfitting**: prepruning, postpruning
* Unsupervised learning (clustering)

## Challenges:

This is a spatio temporal data mining problem. We have LiDAR data for the Downtown Lincoln, which consists of 734,000 points; 911 services calls data crawled from government sites; NOAA weather data. We also contacted the electricity system and water system. But they said the data are sensitive and cannot disclose.

The big problem is that we have different kinds of data based on location. However, they are not overlapped with each other. If the overlapped part dataset is small, it is highly possible that the result would be overfitted. On the other hand, the datasets are not exactly overlapped.

So we first write a program to clean the data and to find the relatively high overlapped data points. However, the data is so big and our program is crashed on my computer. Then we think whether we can use the computer center to process our data. But for student they only allocate small memory to do this. Finally, we came up an idea whether we can plot the data in different color, and see which area has high density. Then it worked!


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://liuyang89116.gitbook.io/my-leetcode-book/behavior_questions/project_3_911_call_data_mining.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
