We are going to show you a profile for a few seconds.
When we are done we will ask you a few questions about the resume.
Don't worry about remembering everything, we just want you to gauge your first impression of this job seeker.
The timer on the right will count down for you.
Click here when you are ready:
Muhammad Ali Valliani
(XXX) XXX-XXXX | XXXX@XXXX.XXX | Union City, CA 94587 | https://github.com/mavalliani/
Results-oriented Machine Learning professional with strong experience in Deep Learning. I have 3 years of experience, with
strong research skills and significant work in quantitative and language models.
Data Science and Machine Learning tasks using Regression, Classifications, Trees, Support Vector Machines, Kernels.
Deep Learning. Convolutional Nets (CNN). Recurrent Nets (RNN). LSTM. Transfer Learning. Adversarial Nets (GAN). etc.
Sequence Models. (Natural Language Processing. Word Embeddings. Transformers. – attention based)
Unsupervised Learning. (Mixture Models. LDA. Clustering.)
Research (quick to understand/implement work from advanced publications, published scholarly blog posts)
Hyper-parameter tuning. Bagging. Boosting. Regularization.
Mathematical Optimization. (convex optimization. gradient descent. Newton’s Method.)
Algorithms. Extensive programming knowledge.
Experienced in working with GPUs, CUDA, AWS.
Statistical Analysis. (Time Series. Bayesian methods. Stochastic processes.); Data Science. (Machine
Learning. Deep Learning. Natural Language Processing.)
Python. R. Spark. SQL.
Data Science toolkit:
ML (Scikitlearn.), DL. (Tensorflow. Keras. Pytorch), NLP. (NLTK. Gensim. Spacy. Embeddings.)
Mathematical Optimization. Ensemble models. applied research.
California State University East Bay, Hayward, CA
08/2018 – 05/2020
GIK Institute of Engineering Sciences and Technology
Machine Learning Engineer
Marketchal Private Limited
06/2015 - 03/2018
Set up machine learning pipelines, Rest APIs and data architectures to processing million+ queries a day.
Predictive modeling of user behavior. Models used: Boosting, Bagging, Feature Engineering, and Clustering (unsupervised).
Optimized campaigns content and bidding using Time series, Quantitative and Sequence Models (NLP). Ensemble Methods.
Shell scripting on Linux including server deployment and automation scripts.
Tools and frameworks used are scikitlearn, Tensorflow, Pytorch, MySQL, PostgreSQL and deployment on AWS.
Machine Learning Fellow
12/2019 – Present
Crawled food images and modeled a Raw Food classifier for prototyping a smart oven. Tensorflow - CNN using Resnet50.
Detection of out-of-distribution data using a model-agnostic gateway module. Used Pytorch and FastAI.
Developed and deployed language and quantitative models on AWS and Amazon Sagemaker. Rest APIs.
Data Science Intern
Branch Metrics Inc (Redwood City, CA)
05/2019 – 8/2019
Worked on a multi-vertical, general purpose entity search using Elastic Search as entity data store. Indexing and Ranking.
Research and develop NLP/NLU component of entity extraction, semantic search and sentiment analysis with statistical
learning for mobile apps with high performance. Frameworks used: NLTK, SpaCy, Gensim.
Provide artificial intelligence, deep learning, machine learning and NLP solutions for knowledge graph.
Generate over 2 million labeled relations for location data using Common Crawl data Wikipedia data.
Developed and optimized scoring algorithms for queries to match with relevant verticals.
Statistical Language models for query understanding (NLP methods), Ranking functions (Extended Vector Space Models).
Semantic Similarity of Sentences. Methods used: Cosine Similarity with Glove, Smooth Inverse Frequency, Word
Movers Difference, Sentence Embedding Models (Infersent and Google Sentence Encoder), ESIM with pre-trained
FastText embedding. Best performing method on Quora Question pair dataset was an Ensemble method with 0.27
log-loss. https://github.com/mavalliani/Semantic-Similarity-of-Sentences [October 2019]
Feature Engineering and Analysis of News data to Predict Stock Price Movements. Model: XGBoost optimized up
to an RMSE of 0.7. Link: https://github.com/mavalliani/News-data-for-stock-prediction [October 2018]
Classification (kNN model) of human activity based on data from Inertial Measurement Units (with Dr. Bradford
Bennett – CSU East Bay). Prediction accuracy of 98% was achieved. Link:
Playing Poker deterministically. Algorithmically unwrapping scenarios where it is ‘risk-free’ to bet.
I maintain a Video Book Publishing site: Jamnosh (https://jamnosh.com) for my love of reading.
Working on generating fiction stories (text data) using advanced NLP methods (Long term project).
Honors & Awards
STEM Scholar - Institute of STEM Education CSUEB (Distinguished Student in Statistics)
Awarded a grant of 20 million Chilean Pesos by CORFO Chile for TOKEN (Startup Chile Portfolio Company).
Dean Honor Student at GIK Institute.
International Mathematics Olympiad (IMO 2009) – Pakistan camp participant.
Teaching Associate – Statistics (CSUEB): Taught Probability, Gaussian and related distributions, Statistical
tests and Regression to a class of 100+ students.