Computers and Technology, 18.08.2020 17:01 pineapplepizaaaaa

You do not need to import any libraries or modules about K-means clustering because you will implement it from scratch. The template of the code is provided, and you just need to write your code at specified locations with "your code is here". Download the dataset ‘k_means_clustering_data. csv’ and save it into your working directory where we can find your source code about this homework. The dataset has two columns (‘x’ and ‘y’) and 42 records. They are 42 points in a 2D plane. Your goal is to group them into K clusters using K-means clustering algorithm. The basic step of k-means clustering is simple. Initially, we determine number of cluster K and select K centroid or center of these clusters from the dataset randomly. Then the K-means algorithm will iterate at the following steps until convergence. a. Update each centroid coordinate based on the data points in the cluster b. Measure the distance of each point in the dataset to the K centroids c. Group the point based on minimum distance this is the code provided please fill it in

def k_means_clustering(data, centroids, k):
centroid_current = centroids
centroid_last = pd. DataFrame()
clusters = pd. DataFrame()
data = pd. read_csv('k_means_clustering_data. csv')
data = [(float(x),float(y)) for x, y in data[['x','y']].values]
# iterate until convergence
while not centroid_current. equals(centroid_last):

cluster_count = 0 #it counts the number of clusters. Cluster IDs start from 0.
# calculate the distance of each point to the K centroids
for idx, position in centroid_current. iterrows():
# your code is here. Save the Euclidean distances into 'clusters'

# your code ends
cluster_count += 1

# update cluster, assign the points to clusters
clusterIDs = []
for row_idx in range(len(clusters)):
# your code is here. Check the distances at every row in 'clusters'. Save the assigned cluster IDs to points. The IDs start from 0

# your code ends
# assign points to clusters. The information is saved in the list and assigned to the dataset.
data['Cluster'] = clusterIDs

# store previous cluster
centroid_last = centroid_current

# Update the centroid of each cluster. All information are in 'data'. You have to calculate the new centroids based on the points in the same cluster.
# The centroid is the center of a list of points. For example, (x1, y1), (x2, y2), ..., (xn, yn). The centroid is (x, y), where x = the mean of (x1, x2, ..., xn) and y = the mean of (y1, y2, ..., yn).
centroids =[]
points= [] # save k lists of points in the list. The points in the same list are in the same cluster.
# your code is here. The K centroids will be saved in 'centroids', e. g. [[1, 2], [3, 4], [5, 6]]

# your code ends
centroid_current = pd. DataFrame(data=centroids, columns = ['x', 'y'])

print("No updates on clusters: ", centroid_current. equals(centroid_last))

print("Convergence! Final centroids:", centroid_current)
# plotting
print('Plotting...')
colors= ['b', 'g', 'r', 'c', 'm', 'y', 'k']

# scatter plot all points. All points are colored circles
for i in range(k):
p = np. array(points[i])
x, y = p[:,0], p[:, 1]

plt. scatter(x, y, color = colors[i])
plt. scatter(centroid_current['x'], centroid_current['y'], marker='^', color = colors[i])

# scatter plot all centroids. All points are colored triangles
for j in range(k):
plt. scatter(centroid_current. iloc[j][0], centroid_current. iloc[j][1], marker='^', color= colors[j])

plt. show()

And this is he data provided

x y
1 0
1 1
1 2
2 0
2 1
2 2
2 7
2 9
3 0
3 2
3 4
3 6
3 8
4 4
4 7
4 9
5 5
5 6
5 7
5 8
5 9
5 10
6 2
6 3
6 8
7 0
7 1
7 2
7 4
7 7
7 9
7 10
8 0
8 1
8 2
8 3
8 8
9 0
9 2
9 3
4 2
5 3

Answers: 1

Show answers

Another question on Computers and Technology

Computers and Technology, 21.06.2019 22:00

What do the principles of notice, choice, onward transfer, and access closely apply to? a. privacyb. identificationc. retentiond. classification

Answers: 1

Answer

Computers and Technology, 23.06.2019 12:30

How is the brightness of oled of the diaplay is controled

Answers: 1

Answer

Computers and Technology, 23.06.2019 13:30

Anetwork security application that prevents access between a private and trusted network and other untrusted networks

Answers: 1

Answer

Computers and Technology, 23.06.2019 13:30

Me ! evelyn is a manager in a retail unit. she wants to prepare a report on the projected profit for the next year. which function can she use? a. pmt b. round c. division d. what-if analysis

Answers: 2

Answer

You know the right answer?

You do not need to import any libraries or modules about K-means clustering because you will impleme...

Questions

Advanced Placement (AP), 22.06.2019 13:00

Ang pagsunod sa mga batas trapiko ay isang katangian ng pagiging mabuting mamamayan. ano ang ipinamamalas nito?...

Chemistry, 22.06.2019 13:00

The molality of calcium chloride (cacl2) in an aqueous solution is 2.46 m. what is mole fraction of the solute?...

Chemistry, 22.06.2019 13:00

How many moles of sulfur dioxide are produced when 4.38 moles of oxygen completely react with iron (iv) sulfide...

Mathematics, 22.06.2019 13:00

Tito solved this equation 2(3x+5)=22+5x-3x 6x+10=22+2x 6x+10-2x=22+2x-2x 4x+10=22 4x+10-10=22-10 4x/4=12/4 x=3...

Physics, 22.06.2019 13:00

The substances that are necessary for producing of certain hormones and that store and transport vitamins...

Mathematics, 22.06.2019 13:00

Gerard has 30$ to spend on lunch's this month lunch costs him 1.50 each day after gerard buys lunch for 8 days...

Biology, 22.06.2019 13:00

The mixture of sperm and fluids from the seminal vesicles, prostate gland, and cowper's glands is called...

Mathematics, 22.06.2019 13:00

If you lived one mile from work and you walked four miles per hour, how long will it take you to walk one mile?...

Biology, 22.06.2019 13:00

We can be sure that a mole of table sugar and a mole of vitamin c are equal in their 1) mass in daltons. 2) mass in grams. 3) number of molecules. 4)...

English, 22.06.2019 13:00

Read the excerpt from loom and spindle. but in a short time the prejudice against factory labor wore away, and the lowell mills became filled with bl...

Chemistry, 22.06.2019 13:00

Ineed ! i have no clue how to do this problem.

English, 22.06.2019 13:00

What is a good topic to write a essay?...

Social Studies, 22.06.2019 13:00

Which of these is a constitutional duty of the lieutenant governor of the state of georgia...

English, 22.06.2019 13:00

Why it's time to lay the stereotype of the 'teen brain’ to rest 2. part b: which detail from the text best supports the answer to part a? a “brain d...

Mathematics, 22.06.2019 13:00

Arectangle has an area of 24 square centimeters. the width of the rectangle is 3 centimeters. what is the length of the rectangle? a) 8 centimetersb)...

English, 22.06.2019 13:00

Read the excerpt from chapter 34 of the awakening. "fine fellow, that lebrun," said arobin when robert had gone. "i never heard you speak of him." "i...

Biology, 22.06.2019 13:00

Describe the impact of deforestation of both abiotic and biotic factors within the tropical rainforest....

Mathematics, 22.06.2019 13:00

At 6 pm the temperature was -2 fahrenheit by midnight the temperature had dropped 7° what was the temperature at midnight...

Mathematics, 22.06.2019 13:00

On the coordinate plane shown below, points g and i have coordinates (6, 4) and (3,2), respectively. a. design a stratedgy in which the pythagorean th...

English, 22.06.2019 13:00

Why it's time to lay the stereotype of the 'teen brain’ to rest 1. part a: which statement identifies the central idea of the text? a experts have s...

More questions: Computers and Technology Another questions