Different between boost, bagging , xgboost , random forest in ML
Boosting and bagging are both ensemble learning techniques in machine learning that aim to improve the performance of individual models by combining the predictions of multiple base models. They work…
Entropy and cross-entropy
why entropy is defined as H(X)=−∑p(x)log(p(x)) Entropy, as defined by the formula H(X) = -∑p(x)log(p(x)), might seem complex at first, but it has a deep and intuitive connection to information…
Working Backwards ( insights, stories and secrets from inside Amazon ) notes
Working Backwards ( insights, stories and secrets from inside Amazon ) by: Colin bryar and bill carr. https://www.amazon.com/Working-Backwards-Insights-Stories-Secrets/dp/1250267595 It provided very good insights how amazon works (1) Bar Raiser…
The leader habit
The book “The leader habit” provide some good insight how to become a leader. Some good points: Delegate well : person’s skill, his interests, and identify what needs to be…
google SRE book notes
google SRE book notes: https://sre.google/sre-book/table-of-contents/ Risk measure aggregate availabilty = successful requests/total requests ( instead of uptime/downtime) release: Branching All code is checked into the main branch of the source…
kafka msg format, how to publish, read
How to Publish a kafka msg Kafka from programmer point of view is: just topic, key, value , headers https://kafka-python.readthedocs.io/en/master/apidoc/KafkaProducer.html send(topic, value=None, key=None, headers=None, partition=None, timestamp_ms=None)[source] Publish a message to a topic. Parameters: topic (str) –…
ML workflow and pipeline orchestration
Kale – Aims at simplifying the Data Science experience of deploying Kubeflow Pipelines workflows. Flyte – Easy to create concurrent, scalable, and maintainable workflows for machine learning. MLRun – Generic mechanism for data…
How to create online courses
https://zapier.com/blog/online-course-platforms/ https://www.thinkific.com/ Youtube?
responsible AI
General intro https://github.com/alexandrainst/responsible-ai https://ai.google/responsibilities/responsible-ai-practices/ https://www.tensorflow.org/responsible_ai Open source implementation https://github.com/microsoft/responsible-ai-toolbox https://www.tensorflow.org/responsible_ai/api_docs https://opendatascience.com/15-open-source-responsible-ai-toolkits-and-projects-to-use-today/ Responsible AI Toolkits for AI Ethics & Privacy TensorFlow Privacy TensorFlow Privacy is a Python library that includes implementations…