Bioinformatics project from the start – Drug discovery part 2 (exploratory data analysis)

Bioinformatics project from the start – Drug discovery part 2 (exploratory data analysis)

HomeData ProfessorBioinformatics project from the start – Drug discovery part 2 (exploratory data analysis)
Bioinformatics project from the start – Drug discovery part 2 (exploratory data analysis)
ChannelPublish DateThumbnail & View CountDownload Video
Channel AvatarPublish Date not found Thumbnail
0 Views
This video represents part 2 in a multi-part video series about the Bioinformatics Project from scratch. In this video I show how to use the dataset from part 1 and use the SMILES notation (which represents the unique chemical structure of compounds) to calculate molecular descriptors. The descriptors we will calculate are Lipinski's descriptors (molecular weight, LogP, number of hydrogen bond donors and number of hydrogen bond acceptors). Finally, we will perform exploratory data analyzes by creating simple box plots and scatter plots to distinguish differences between the active and inactive sets of compounds.

Buy me a coffee: https://www.buymeacoffee.com/dataprofessor

Summary of Part 1: I showed you how to collect an original dataset in biology that you can use in your Data Science Project. In particular, I demonstrated how to download and preprocess the biological activity data from the ChEMBL database. The dataset consists of compounds (molecules) that have been biologically tested for their activity against the intended target organism/protein.

Code:
Part 1 (concise) Code: https://github.com/dataprofessor/code/blob/master/python/CDD_ML_Part_1_Bioactivity_Data_Concised.ipynb
Part 2 code: https://github.com/dataprofessor/code/blob/master/python/CDD_ML_Part_2_Exploratory_Data_Analysis.ipynb

Playlist:
Check out our other videos in the following playlists.
Data Science 101: https://bit.ly/dataprofessor-ds101
Data Science YouTuber Podcast: https://bit.ly/datascience-youtuber-podcast
Virtual Data Science Internship: https://bit.ly/dataprofessor-internship
Bioinformatics: http://bit.ly/dataprofessor-bioinformatics
Data Science Toolbox: https://bit.ly/dataprofessor-datasciencetoolbox
Streamlit (web app in Python): https://bit.ly/dataprofessor-streamlit
Shiny (web app in R): https://bit.ly/dataprofessor-shiny
Google Colab tips and tricks: https://bit.ly/dataprofessor-google-colab
Pandas Tips and Tricks: https://bit.ly/dataprofessor-pandas
Python Data Science Project: https://bit.ly/dataprofessor-python-ds
R Data Science Project: https://bit.ly/dataprofessor-r-ds

Subscribe:
If you're new here, it would mean a lot to me if you would consider subscribing to this channel.
Subscribe: https://www.youtube.com/dataprofessor?sub_confirmation1

Recommended tools:
Kite is a FREE AI-powered coding assistant that helps you code faster and smarter. The Kite plugin integrates with all top editors and IDEs to give you smart additions and documentation as you type. I use Kite and I love it!
View Kite: https://www.kite.com/get-kite/?utm_mediumreferral&utm_sourceyoutube&utm_campaigndataprofessor&utm_contentdescription-only

Recommended books:
Practical machine learning with Scikit-Learn: https://amzn.to/3hTKuTt
Data science from scratch: https://amzn.to/3fO0JiZ
Python Data Science Handbook: https://amzn.to/37Tvf8n
R for Data Science: https://amzn.to/2YCPcgW
Artificial Intelligence: The Insights You Need from the Harvard Business Review: https://amzn.to/33jTdcv
AI Superpowers: China, Silicon Valley and the New World Order: https://amzn.to/3nghGrd

Stock photos, images and videos used on this channel:
https://1.envato.market/c/2346717/628379/4662

Follow us:
Medium: http://bit.ly/chanin-medium
FaceBook: http://facebook.com/dataprofessor/
Website: http://dataprofessor.org/ (under construction)
Twitter: https://twitter.com/thedataprof/
Instagram: https://www.instagram.com/data.professor/
LinkedIn: https://www.linkedin.com/in/chanin-nantasenamat/
GitHub 1: https://github.com/dataprofessor/
GitHub 2: https://github.com/chaninlab/

Disclaimer:
Recommended books and tools are affiliate links that provide me with a share of sales at no cost to me, which will help improve the content of this channel.

#dataprofessor #bioinformatics #drugdiscovery #drugdesign #chembl #cheminformatics #bioinformaticsproject #bioinformaticproject #drug #drugs #molecule #molecules #machinelearning #lecture #dataprofessor #bigdata #QSAR #QSPR #machinelearning #datascienceproject #randomforest #decisiontree #svm #neuralnet #neuralnetwork #supportvectormachine #python #learnpython #pythonprogramming #datascience #datamining #bigdata #datascienceworkshop #dataminingworkshop #dataminingtutorial #datasciencetutorial #ai #artificialintelligence #tutorial #dataanalytics #dataanalysis #machinelearningmodel

Please take the opportunity to connect and share this video with your friends and family if you find it helpful.