Implementation of DBSCAN model by Kelang-Tian · Pull Request #75 · sql-machine-learning/models

Kelang-Tian · 2020-07-02T12:41:58Z

Implementation of the clustering model.
The test process will be submitted next time.

Yancey0623 · 2020-07-06T08:30:16Z

tests/test_dbscan.py

+iris_target = iris.target
+
+
+def check_model_exist(path):


It seems that this function is not used?

Remove the function.

Yancey0623 · 2020-07-06T10:10:35Z

Makefile


 install-requirements:
-	pip install -U -e .
+	pip install --default-timeout=1000 -U -e .


Why increasing the timeout threshold ?

Timeout occurred while installing the packages because of network instability.

Yancey0623 · 2020-07-06T11:22:55Z

sqlflow_models/dbscan.py

+import pandas as pd
+from sklearn import datasets, metrics
+
+def optimizer():


It seems that it's not a NN model, should we remove this function?

Yes, there is no need to set optimizer.

Yancey0623 · 2020-07-06T11:27:29Z

sqlflow_models/dbscan.py

+
+        return self
+
+    def _split_dataset(self, dataset):


Do we actually need this function?

Replace this function with _read_Dataset_data, which split tf.dataset type data into features and labels(If it exists).

Yancey0623 · 2020-07-06T11:34:14Z

sqlflow_models/dbscan.py

+                cluster_labels[self.clusters[i][j]] = i
+        return cluster_labels
+
+    def fit(self, X):


Instead of overriding fit function, we can implement sqlflow_train_loop to define the custom train loop logic. You may find that sqlflow_train_loop is not the standard API of Keras model, it's just let SQLFlow runtime to know to run the custom train loop.

c.f. https://github.com/sql-machine-learning/sqlflow/blob/9628487783878e9f019b1bf379e192134344e3a3/python/sqlflow_submitter/tensorflow/train_keras.py#L173

Enables sqlflow_train_loop function to handle tf.dataset type of data, because sqlflow server invokes tf.dataset.
https://github.com/sql-machine-learning/sqlflow/blob/483b8676cf93f373d5073d84b0bee311bb122012/python/runtime/tensorflow/input_fn.py#L72

Yancey0623 · 2020-07-06T11:35:15Z

sqlflow_models/dbscan.py

+                  purity_score(y_df, self.labels_))
+'''
+if __name__ == '__main__':
+    from sklearn.datasets.samples_generator import make_blobs


Can we remove the comment code and testing this model in the test_db_scan.py? A referenced test script: https://github.com/sql-machine-learning/models/blob/develop/tests/test_arima_with_stl_decomposition.py

Comment code removed and rename the test_db_scan.py.

Kelang_Tian added 5 commits July 2, 2020 20:30

dbscan model without test

b4908be

dbscan model without test

827238b

dbscan model with test

db6b2d8

dbscan model without test

8b0ef5f

dbscan model without test

cca7b66

Kelang-Tian mentioned this pull request Jul 2, 2020

Add DBSCAN model sql-machine-learning/sqlflow#2572

Open

Kelang_Tian added 2 commits July 4, 2020 16:49

add test with iris

0e84ac0

add func sqlflow_train_loop

3888acc

Yancey0623 reviewed Jul 6, 2020

View reviewed changes

Kelang_Tian added 2 commits July 8, 2020 18:48

add input iris tf.dataset

99ec7fc

rename dbscan test

143c856

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of DBSCAN model#75

Implementation of DBSCAN model#75
Kelang-Tian wants to merge 9 commits intosql-machine-learning:developfrom
Kelang-Tian:model-tkl

Kelang-Tian commented Jul 2, 2020

Uh oh!

Yancey0623 Jul 6, 2020

Uh oh!

Kelang-Tian Jul 8, 2020

Uh oh!

Yancey0623 Jul 6, 2020

Uh oh!

Kelang-Tian Jul 8, 2020

Uh oh!

Yancey0623 Jul 6, 2020

Uh oh!

Kelang-Tian Jul 8, 2020

Uh oh!

Yancey0623 Jul 6, 2020

Uh oh!

Kelang-Tian Jul 8, 2020

Uh oh!

Yancey0623 Jul 6, 2020

Uh oh!

Kelang-Tian Jul 8, 2020

Uh oh!

Yancey0623 Jul 6, 2020

Uh oh!

Kelang-Tian Jul 8, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Kelang-Tian commented Jul 2, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants