Large Scale Machine Learning and Other Animals

Thanks for sharing that such good information

2019-01-10T00:09:37.672-08:00

Thanks for sharing that such good information

If your edges are not binary (0 or 1) you may have...

2017-10-21T09:42:33.414-07:00

If your edges are not binary (0 or 1) you may have values outside the range.

Hi Danny, I am a little bit confused about the co...

2017-10-15T21:13:49.299-07:00

Hi Danny,

I am a little bit confused about the cosine similarity choice.

According to the command line argument, "FOR itemcf2: --distance=XX, 3 = PEARSON, 4=COSINE, " itemcf2: --distance=4 means cosine similarity, but in the reference page, cosine distance are in itemcf, not in itemcf2.

I tried with itemcf2, --distance=4 command, which give me similarity value greater than 1, I assume the cosine similarity value should between -1 and 1, right?

Could you help me figure out which --distance index should I choose?

Thanks,
Fen

Thank you, very interesting. I don't think the...

2017-05-25T03:12:44.964-07:00

Thank you, very interesting. I don't think the big players are particularly a problem, people looking for pipeline.io-style solutions are trying to avoid the big player lock trap.

Chris mentions a list with 20 names: do you know what some names on that list might be, besides those you mention?

i am getting this error when i try to configure......

2017-02-10T23:47:15.556-08:00

i am getting this error when i try to configure...

configure: error: cannot find any BLAS library, which is required by LAPACK.
You can override this error by using "--without-blas" option, but the
functionality of the IT++ library will be limited. You have been warned!
please help

Thanks for the mention, Danny! Love your work. H...

2017-01-26T09:40:42.293-08:00

Thanks for the mention, Danny! Love your work.

Here's an updated video, btw: https://youtu.be/swiPWUxBvSc

Here's the jupyter notebook that powers the entire demo: https://github.com/fluxcapacitor/pipeline/blob/master/jupyterhub.ml/notebooks/Conferences/StartupML/Jan-20-2017/SparkMLTensorflowAI-HybridCloud-ContinuousDeployment.ipynb

Thanks again!

I recommend trying out GraphLab Create: http://gra...

2014-07-26T11:44:23.042-07:00

I recommend trying out GraphLab Create: http://graphlab.com/products/create/overview.html it will be easier to set this contest with Graphlab Create.

The "\" tells the linux shell to ignore the special meaning of ":" the filename should be something:info

Hi Danny, very impressive work! But when I try to...

2014-07-26T09:44:55.056-07:00

Hi Danny,
very impressive work!
But when I try to create the file yelp_training_set_review.csv\:info, neither windows nor linux allow me to create a file with the symbol \ and :.
What should I do?

Ben

Great find! I just merged your pull request. Much ...

2014-07-20T07:54:39.386-07:00

Great find! I just merged your pull request. Much appreciated!

Hi Danny, I looked into gensgd.cpp to find out the...

2014-07-20T04:32:48.935-07:00

Hi Danny, I looked into gensgd.cpp to find out the difference of RMSE. It turned out that step3 gets gensgd_rate multiplied 2 times instead of 1 for a step. Now it works. This seems to date from 2 commits made on oct 4 and 10 in 2013. I made a pull request. Regards, Xavier

p.s. I will be happy to setup up a phone call to d...

2014-07-19T15:24:08.706-07:00

p.s.
I will be happy to setup up a phone call to discuss your problem and give some advice regarding Graphlab Create evaluation.

Our project has open source foundations and you ca...

2014-07-19T15:22:36.768-07:00

Our project has open source foundations and you can always stick to the open source if you like. GraphLab Create, while not open source, is still free in the foreseeable future. Fine tuning the open source directly is more difficult. I am now traveling, I will be happy to take a look at the example in a few days - if you don't mind please post a question at our user forum: http://forum.graphlab.com so I could keep track of the issue and not forget.

Hi Danny, Thanks for your feedback. GraphLab Creat...

2014-07-19T15:02:07.847-07:00

Hi Danny,
Thanks for your feedback. GraphLab Create seems great but seems risky to me: I went into terms & conditions and read "We grant you a limited, revocable license". I am currently testing different solutions and it seems hard to know what is the future of such an option considering t&c.

Hi Xavier, We have re-implmentated this code as p...

2014-07-19T13:44:07.769-07:00

Hi Xavier,
We have re-implmentated this code as part of GraphLab Create. You are highly encouraged to try it out - it is free and it gets to much better results. Send me an email and I will send you the ipython notebook to reproduce the exact same experiment in GLC.

Hello, Great thanks for this post. I was able to r...

2014-07-19T09:29:32.273-07:00

Hello, Great thanks for this post. I was able to run all of the different samples but I get an RMSE far higher than expected even after many iterations.

For the exemple which should lead to 2 minutes RMSE, I get an RMSE of 32 minutes after 19 iterations.

I run an Ubuntu, could it be a library issue or setup ?

Thanks

YES.

2014-07-11T07:35:43.719-07:00

YES.

Hi, will the talks from this conference be availab...

2014-07-03T08:51:24.860-07:00

Hi, will the talks from this conference be available online?

is it possible to use GraphLab in java? is it poss...

2014-07-03T02:47:17.056-07:00

is it possible to use GraphLab in java?
is it possible to use graphlab for web classification?

Thanks. In order to low RMSE for the test sets, I ...

2014-07-02T07:38:32.375-07:00

Thanks. In order to low RMSE for the test sets, I am tuning rbm_alpha, rbm_beta, D (not sure if it is necessary to tune D). Is there any other parameters I need to tune? Thanks.

D is always the latent feature vector width (as in...

2014-07-01T22:21:20.699-07:00

D is always the latent feature vector width (as in all methods).
multiplicative step decrement is how much you decrease the SGD step size. The default is 0.9, namely you multiply by 0.9 the step size after each iteration.

Dear Danny, About rbm, what is the parameter D? ho...

2014-07-01T12:21:07.155-07:00

Dear Danny,
About rbm, what is the parameter D? how to give a value to D? what does the rbm_mult_step_dec mean? Thanks!

2014-07-01T08:53:11.452-07:00

This comment has been removed by the author.

We would love a code contribution!

2014-06-30T21:31:55.141-07:00

We would love a code contribution!

BigCLAM is indeed quite interesting, I am in the p...

2014-06-30T14:19:14.096-07:00

BigCLAM is indeed quite interesting, I am in the process of implementing it in GraphChi right now by building upon your SGD code.

Hi, In graphchi we have item based methods. In Gr...

2014-06-07T09:59:11.957-07:00

Hi,
In graphchi we have item based methods. In GraphLab Create we are working on k-NN methods. Once it is ready we will announce on our website.