• Hey
    I'm Jai

    Machine Learning Engineer and Data Science

    View Resumé

  • Find out more
    about my research

    Publications in the field of Deep Learning and Biomedicine

    View Research

About Me

Who Am I?

Hey I'm Jai Kotia! I am a Master's in Computer Science student at Johns Hopkins University, expected to graduate in Summer 22. I am currently applying to full-time job positions in Machine Learning, Data Science and Software Engineering.

I have strong interests in automation using technology and am excited about its application in the Finance and Biomedical sectors. I strongly believe that software has the power to bridge gaps in most industries, between an efficient and affordable solution. Be it with the use of sophisticated AI or full-stack apps and websites.

My background is fairly versatile, as I have experiences of over a year in Applied Machine Learning, Data Science, Android App Development and Research in AI (NLP/CV). This versatility stems from the fact that technology is always evolving. I believe it's equally important to master your current skill set and be prepared to learn new skills. This also affirms my decision to pursue a career in this field. After performing research and working with the latest technologies over a significant period, I have a good understanding of my own capabilities and interests. I have found a good balance between working on projects that I like and ensuring I make significant contributions towards it.

I have authored the blog éclairé, where we published exciting articles about AI and concise reviews of technical papers in the field. I have also been the instructor of the course Python in Finance for Yodalearning.

So feel free to reach out if you have any open positions that you feel may interest me!

Machine Learning

Data Science

Software Engineering

Cups of Coffee
Projects
Internships
Publications
Research

Publications

Discovery of adversarial attacks on deep neural networks, have exposed the vulnerabilities of these networks, wherein they often entirely fail to classify the attack generated images. While deep learning networks have generated promising results in performing brain tumor classification, there has been no analysis of their susceptibility to adversarial attacks. Vulnerability to adversarial attacks can render the deep neural networks useless for practical medical application. In this paper, a study has been performed to determine extent of white box adversarial attacks on convolutional neural networks used for brain tumor classification. Three different adversarial attacks are implemented on the network, namely Noise generated, Fast Gradient Sign, and Virtual Adversarial Training methods. The performance of the network under these attacks is analyzed and discussed. Furthermore, in the paper it is shown how these networks perform when trained on the adversarial attack generated images, which could be a possible solution to prevent the failure of the classification networks against adversarial attacks.
View publication
View paper

As cancer treatments are gaining momentum, in a bid to improve drug potency, doctors are looking towards precision cancer medicine. Here the drug prescriptions are tailored to the patients gene changes. In this paper, we aim to automate the task of drug selection, by predicting the clinical outcome of using a particular drug on a combination of the patients gene, variant and cancer disease type. While the main idea behind precision cancer treatment is to identify drugs suitable to each patients unique case, it is justifiable for us to assume that there exists a predictive pattern in these prescriptions. We propose to implement this prediction using three machine learning models, the Support Vector Machine, the Random Forest Classifier and the Deep Neural Network. The models yielded promising results of over 90% accuracy and over 95% ROC-AUC score. This positive outcome affirms our assumption that there exists a predictive pattern in precision treatments, that could be extrapolated to help automate such tasks. We further analyzed the data set and identified latent relations between drug, cancer disease, target gene and gene variant. This exploration uncovered some significant patterns where we can observe how a particular drug has had successful results in treating a particular cancer and targeting specific gene variants.
View publication
View paper

Face Recognition is steadily making its way into commercial products. As such, the accuracy of Face Recognition systems is becoming extremely crucial. In the Firefly Algorithm, the brightness of fireflies is used to measure attraction between a pair of unisex fireflies. The firefly with higher brightness attracts the less bright firefly. The objective function is defined in proportion to the brightness, to define a maximization problem. This chapter aims to present the promising application of the Firefly Algorithm for Face Recognition. The Firefly Algorithm is used in a hyper dimensional feature space to select features that maximize the recognition accuracy. This chapter delineates how the Firefly Algorithm is a suitable algorithm for selection of the features in a Face Recognition model. The Firefly Algorithm is then applied on this feature space to identify and select the best features. Fireflies are arbitrarily placed on various focal points of the image under consideration. The advantage of this approach is its fast convergence in selecting the best features. The gamma parameter (γ) controls the movement of fireflies in this feature space and can be tuned for gaining an improvement in the performance of the Face Recognition model. This chapter aims to evaluate the performance and viability of using the Firefly Algorithm for Face Recognition.
View chapter

Cuckoo Search (CS) is a nature inspired and meta-heuristic algorithm which is based on a brood reproductive strategy of cuckoo birds to increase their population. This algorithm mainly serves to determine the maximum or minimum value of a particular problem which is known as the objective function. CS has reportedly outperformed other nature inspired algorithms in terms of computational efficiency and the speed of convergence to reach an optimal solution. This chapter aims at exploring the application of CS to determine the parameters of Artificial Neural Networks (ANN). The inherent problem with traditional training of ANNs using backpropagation is that the learning process cannot guarantee a global minimum solution and has a tendency of getting trapped in a local minima. The working of such ANN models are restricted to a differentiable neuron transfer function. The CS algorithm has been observed to provide a solution without the use of derivates to optimize such convoluted problems. The usage of ANNs across a wide range of problems including classification tasks, image processing, signal processing, etc. justifies the application of CS to the backpropagation procedure of ANNs to achieve a faster rate of convergence and avoid the local minima problem. This chapter also presents discussions and results on how ANNs optimized with variants of CS perform when applied to detection of chronic kidney disease, modeling of operating photovoltaic module temperature and forest type classification.
View chapter

While deep learning systems have provided breakthroughs in several tasks in the medical domain, they are still limited by a problem faced by most deep learning applications. That problem is the dependency on the availability of training data. To counter this limitation, there is active research ongoing in few shot learning. Few shot learning algorithms aim to overcome the data dependency by exploiting the information available from a very small amount of data. The success of these algorithms can prove to be a significant advancement in the application of deep learning systems. In medical imaging, there is often a limitation on the available data, due to the rare occurrence of some diseases. As a result, there are several potential use cases for few shot learning algorithms in medical imaging. In this chapter, the background and working of few shot learning algorithms is explained. Then there is a study of the problems faced in medical imaging related to availability of limited data. After establishing context, the recent advances the in application of few shot learning to medical imaging tasks such as classification and segmentation are explored. The results of these applications are examined with a discussion on its future scope.
View chapter

Experience

Work Experience

Center for Data Science in Emergency Medicine, JHU

Data Science Intern Jun 2021 - Present

Performed statistical analysis on medical data to derive predictive indicators for acute kidney injury. Ingested raw data and created processed tables on Microsoft SQL server.

Bioengineering Innovation & Design, JHU

Machine Learning Research Intern Feb 2021 - May 2021

Implemented regression models to predict patients' blood pressure given non-invasive parameters tracked in real-time to prevent hypotension during dialysis. Models used were linear regression, RNN and LSTM.

DataBrain

Deep Learning Research Intern Jan 2020 - Dec 2020

Performed causal inference, clustering and prediction in customer text data to find factors leading to customer churn. Increased performance by 150%, through migration of codebase from Python to PySpark on AWS cluster. Mined correlation in event sequences and generated new features through statistical analysis of data.

Forus Health

Deep Learning Research Intern Aug 2019 - Feb 2020

Writing and implementing deep learning models for early detection of systemic diseases using biomarkers. Researching relevant work in the field in order to identify biomarkers that generate reliable accuracy. Application of suitable image processing algorithms on the medical images for feature extraction

CSE Department, Indian Institute of Technology Bombay

Machine Learning Intern Jun 2019 - Aug 2019

Writing and testing python implementations of machine learning and deep learning assignment models. Creation of modules to evaluate the performance of the output of these models, constrained by test cases. Configuration of the modules to host them on a web server, in order to enable automatic model evaluation.

Jatana

Deep Learning Research Intern Oct 2018 - May 2019

Performed training and deployment of deep learning models on the Google Cloud Platform using ML Engine. Research in increasing performance of Natural Language Processing models to be used in production. Experimenting the functionality of new deep learning models to extract useful applications on our data.

NISCIT Tech

Android App Developer Aug 2018 - Sep 2018

Developed the screens for a social media application and added interface for using AWS backend. Designed the user interaction flow for the application.

Symphony

Android App Developer Sep 2017 - Feb 2018

Frontend development for a social media application where I built the various application screens. UI/UX designing to assist with front end themes and created media content using Adobe AE.

Unicode

Mentor & Developer Jan 2018 - Jun 2019

During SE, built a college file sharing application as an Android developer. As a TE mentor, assisting junior developers in developing an attendance tracking application.

What do I do?

Here are some of my expertise

Machine Learning

Tensorflow, Keras, PyTorch, Google ML Engine, OpenCV

Android Development

Android Studio, SQLite, Firebase, Room

Web Development

Node.JS, MongoDB, Javascript, MySQL, Flask

Languages

Python, Java, C, Mark Up (HTML & XML)

Courses

Algorithms, Data Structures, Computer Networks

Others

Version Control, Blockchain, Heroku, Photoshop & AE

Get in Touch

Contact

jaikotia10@gmail.com

542, S.V.P. Road, Gandhi Building, 2nd Floor, Chowpatty, Mumbai - 400007

+91 9619909302 | +91 022 23611322