top of page

SAGNIK GHOSAL

SDE @ AWS CloudWatch Logs | Ex-Senior Software Engineer @ Capgemini | MS Data Science | B.E. Electrical Engineering

WhatsApp Image 2023-03-23 at 9.51.46 PM.jpeg

About Me

I am presently working as an SDE at AWS CloudWatch Logs, based in Seattle, USA. I hold a Master's degree in Data Science from the University of Washington, Seattle, and a Bachelor's degree in Electrical Engineering from Jadavpur University, India.

Previously, I worked as an SDE intern at Amazon and as a Senior Software Engineer at Capgemini. I am a published author with 100+ citations and a reviewer across multiple international journals and conferences. Additionally, I was selected as a research intern at Microsoft Research India and as one of the 200 Young Researchers globally to attend the 10th Heidelberg Laureate Forum 2023, held in Heidelberg, Germany.

My technical interests and proficiencies span various fields, including software development, data science, computer vision, machine and deep learning. Here are some of my skills:

  • Data Analytics and BI: RDBMS, Statistical Modeling, Machine Learning, Deep Learning, Power BI, MS Excel

  • Languages: Python, Java, R, C/C++, SQL, MATLAB, LaTeX

  • Developer Tools: Git and Version Control, PyCharm, Jupyter Notebook, Google Colab, Flask, HTML, CSS, PySpark

  • Libraries and Packages: Pandas, NumPy, TensorFlow, Keras, Scikit-learn, Matplotlib, Seaborn, NLTK, OpenCV

  • Software Development: DSA, OOP, Agile Methodologies

  • Cloud Computing: Amazon Web Services, Google Cloud Platform, Microsoft Azure                               

​

EDUCATION

MS in Data Science, University of Washington, Seattle, USA, September 2022 - March 2024, GPA: 3.93/4.0

B.E. in Electrical Engineering, Jadavpur University, Kolkata, India, July 2017 - June 2021, GPA: 9.1/10

INDUSTRY EXPERIENCE

Amazon Web Services, Seattle USA

​Software Development Engineer Intern                                        June 2023 - September 2023

  • Streamlined load testing with a One-touch mechanism, simplifying inputs, enabling flexible configurations, and automating test validity checks and metrics.

  • Implemented changes in frontend and backend to send client-side 400 errors to deny listed customers with a custom error message asking to reach out to support.

Capgemini, Hyderabad, India

Senior Software Engineer                                                     June 2021 - August 2022

  • Translated SAS code to Pandas/PySpark, reducing computation time from hours to minutes.

  • Created PowerBI dashboards for clear data visualization.

  • Collaborated with clients to resolve software-related technical problems and tackle new challenges.

RESEARCH EXPERIENCE

University of Washington, Seattle, USA

Graduate Research Assistant                                                  January 2023 - March 2024​

  • Developed a DNA design algorithm considering DNA polymerase bonding, sequences, and melt temperature to improve genetic material comprehension and manipulation.

  • Developed an algorithm to match specific assay/workflow requirements and collect real-time fluorescence data.​

  • Developed a DeDx framework in Python and R, using a two-step scikit-learn pipeline for improved diagnostic tool accuracy. Evaluated the system's performance to maximize predictive values and population coverage.​

Indian Statistical Institute, Kolkata, India

​Research Assistant                                                          June 2019 - September 2021

  • Developed an unsupervised streaming anomaly detection framework with dynamic graphs and clustering, surpassing SOTA models on 31 real and 22 synthetic datasets in accuracy, precision, recall, F-score, and MCC.

  • Framework parameters include overestimated k for KNN and model retraining frequency, presented in a research paper currently under review.

  • Achieved 90% accuracy in a face recognition app using PCA and the ORL face dataset, and also wrote comprehensive reports on Multimodal Machine Learning and Subspace Learning.

Jadavpur University, Kolkata, India

Research Assistant                                                           March 2021 - October 2021

  • Developed a fuzzy ensemble deep neural model to identify human activities using sensor multimodal data.

  • The proposed technique improved SOTA accuracy by 5% and reduced computation time by 10%.

  • Achieved better performance than SOTA models on four real-life human activity recognition datasets.

  • Published the research findings in the IEEE Internet of Things Journal.

IIIT Naya Raipur, India

Research Assistant                                                              May 2020 - August 2021

  • Led a team of 3 to develop an autonomous deep neural, non-invasive framework to measure and monitor three essential vitals: Heart Rate (HR), Blood Pressure (BP), and Oxygen Saturation (Sp02). The model achieved R-squared correlation coefficients of 0.9707, 0.9886, 0.9714, and 0.994 for HR, systolic BP, diastolic BP, and SpO2, respectively. Published the findings in Elsevier Internet of Things Journal.

  • Led a team of 4 to develop a smart blood glucose and diabetes sensing model which achieved 10.7% MAE, 1.12 mg/dL bias, 91.89% accuracy, 94% sensitivity, and 82% specificity. Published the findings in IEEE Sensors Journal.

  • Developed blood hemoglobin level and anemia-polycythemia screening model with ±0.33 g/dL accuracy, 0.2 g/dL bias, and 90% sensitivity, with adaptive K selection in KNN and automatic brightness adjustment. Published the findings in ACM Transactions on Computing for Healthcare.

  • Developed a smartphone-based model for measuring blood hemoglobin levels with an accuracy of ±0.32 g/dL and screening for anemia with 89% accuracy, and published the findings in IEEE Sensors Journal.

PROJECTS

  • Fraudulent Medical Claims Detection                                                   [Github Link]

    • ​Developed an infrastructure to detect false claims using data visualization and modeling. Used PyOD library to implement supervised and unsupervised models and Flask to develop the web application.                                        

​

  • Anomaly Detection in Streaming Environment                                            [Github Link]

    • ​Developed a dynamic tree clustering-based anomaly detection model for streaming environment. The proposed model is fast, memory efficient, highly accurate, and requires minimal user input. Tested on 31 real-life and 22 synthetic datasets, it surpasses the performance of existing state-of-the-art streaming anomaly detection frameworks.   

​

  • Human Activity Recognition                                                            [Github Link]

    • Developed a fuzzy ensemble-based approach using deep neural networks for Human Activity Recognition using sensor multimodal data. Proposed a novel rewarding and penalization logic that improved the existing state-of-the-art accuracy by 5% while reducing the computation time by 10%, tested on 4 publicly available real-life HAR datasets.        

​

  • sHEMO: Smartphone Spectroscopy for Blood Hemoglobin Level Monitoring                 [Github Link]

    • ​​Developed an algorithm in Python that predicts blood hemoglobin with an accuracy of 97% and anemia screening with an accuracy of 89%, providing an economical option for mass anemia screening. Published the findings in IEEE Sensors journal.                                        

​

  • gluCam: Smartphone Based Blood Glucose Monitoring and Diabetic Sensing               [Github Link]

    • ​Led a 4 member team to develop a smartphone-based blood glucose measurement and diabetic sensing model that reported an accuracy of 91.89% and sensitivity of 94% in diabetic sensing tested on 81 patients.                                                                                          

  • Chatbot                                                                                [Github Link]

    • Developed a chat application using Python and Google Cloud Console by performing data pre-processing, vectorizations, and implementing an encoder-decoder-based LSTM neural network using PyTorch. The model reported an overall accuracy of 93.5% on the test dataset. It reduced customer waiting time and facilitated easier ordering, increasing sales in the restaurant where it was deployed.                                                                                           

  • Non-Invasive Anemia-Polycythemia Detection in the IoMT                               [Github Link]​

    • ​Developed an algorithm in Python that estimates blood hemoglobin using both conjunctiva and fingernails as ROI by leveraging the feature of the IoMT. The model reported an accuracy of 98.4% in hemoglobin prediction, tested over 100 patients.

​ 

  • Object Detection and Distance Estimation                                              [Github Link]

    • ​Developed an algorithm in MATLAB that estimates the distance of the object(s), from a two-camera setup using the triangularization technique. The model reported an average accuracy of 95% over 50 images, with the highest being 99.8%.   

​

  • Depth Map From Stereoscopic Images                                                   [Github Link]

    • ​Successfully implemented state-of-the-art depth estimation technique using MATLAB programming language. The model estimates disparity and re-constructs the point cloud using stereo images.

​​

  • Comparison of Filtering Techniques                                                   [Github Link]

    • ​Developed a comparison algorithm in Python that demonstrated the efficacy of a Gaussian filter in the case of Gaussian noise and a Median filter for salt and pepper noise.                                                                                          

PUBLICATIONS

JOURNALS:

  • DeepVitals: Deep neural and IoT based vitals monitoring in smart teleconsultation system, â€‹Sagnik Ghosal, Debanjan Das, Venkanna Udutalapally, Srivatsan Sridhar, Syed Maaiz Syed Shabbeer Basha, Preetam Narayan Wasnik, Elsevier Internet of Things Journal, Volume 25, pp. 101117, April, 2024, DOI: 10.1016/j.iot.2024.101117                                                                               

  • NoFED-Net: Non-Linear Fuzzy Ensemble of Deep Neural Networks for Human Activity Recognition, â€‹Sagnik Ghosal, Mainak Sarkar, Ram Sarkar, IEEE Internet of Things Journal, Volume 9, Issue 18, pp. 17526-17535, 15 Sept, 2022, DOI: 10.1109/JIOT.2022.3155560                                                         

  • CoviLearn: A Machine Learning Integrated Smart X-Ray Device in Healthcare Cyber-Physical System for Automatic Initial Screening of COVID-19, Debanjan Das, Sagnik Ghosal, Saraju P. Mohanty, SN Computer Science, Volume 3, Issue 150, pp. 24869-24878, 1 Feb, 2022, DOI: 10.1007/s42979-022-01035-x                                                                                             

  • iNAP: A Hybrid Approach for NonInvasive Anemia-Polycythemia Detection in the IoMT, Sagnik Ghosal, Debanjan Das, Venkanna Udutalapally, Preetam Narayan Wasnik, ACM Transactions on Computing for Healthcare, Volume 3, Issue 3, July 2022, DOI: 10.1145/3503466                                           

  • gluCam: Smartphone Based Blood Glucose Monitoring and Diabetic Sensing, Sagnik Ghosal, Abhishek Kumar, Venkanna Udutalapally, Debanjan Das, IEEE Sensors Journal, Volume 21, Issue 21, pp. 24869-24878, 1 Nov.1, 2021, DOI: 10.1109/JSEN.2021.3116191                                                           

  • sHEMO: Smartphone Spectroscopy for Blood Hemoglobin Level Monitoring in Smart Anemia-Care, Sagnik Ghosal, Debanjan Das, Venkanna Udutalapally, Asoke K Talukder, Sudip Misra, IEEE Sensors Journal, Volume 21, Issue 6, pp. 8520-8529, 15 March.15, 2021, DOI: 10.1109/JSEN.2020.3044386                     

​

CONFERENCES:

  • Addressing Class Imbalance in Semi-supervised Image Segmentation: A Study on Cardiac MRI, Hritam Basak, Sagnik Ghosal, Ram Sarkar, International Conference on Medical Image Computing and Computer Assisted Intervention – MICCAI 2022. MICCAI 2022. Lecture Notes in Computer Science, vol 13438. Springer, Cham, DOI: 10.1007/978-3-031-16452-1_22                                                     

  • Monocular Depth Estimation Using Encoder-Decoder Architecture and Transfer Learning from Single RGB Image, Hritam Basak, Sagnik Ghosal, Mainak Sarkar, Mayukhmali Das, Soham Chattopadhyay, 2020 IEEE 7th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON), 2020, pp. 1-6, DOI: 10.1109/UPCON50219.2020.9376365                     

​

TECHNICAL REPORTS:

  • Multimodal Machine Learning, Theoretical and Empirical Machine Learning, Indian Statistical Institue, November 11, 2019, Link to Paper                                                            

  • Subspace Machine Learning, Theoretical and Empirical Machine Learning, Indian Statistical Institute, November 11, 2019, Link to Paper                                                                     

ACHIEVEMENTS

  • Started working as a full-time SDE at AWS CloudWatch Logs, based in Seattle, USA.

  • Graduated with a Master's in Data Science from UW Seattle, and ranked among the top 5% of the class.

  • Selected as one of the 200 Young Researchers worldwide to attend the 10th Heidelberg Laureate Forum.

  • Selected as an SDE intern at Amazon Web Services, Seattle, for Summer 2023.

  • Started working as a Graduate Research Assistant in Lutz Lab at UW Seattle.

  • Selected as a Research Intern at Microsoft Research India to work on low-cost diagnostics.

  • Received admission offers from UW Seattle, USC, and Northwestern University for MS in Data Science/Analytics.

  • Graduated with a Bachelor's in Electrical Engineering from Jadavpur University, and ranked among the top 10% of the class.

  • Ranked 70 out of 2081 students from Jadavpur University, Kolkata on GeeksforGeeks.

  • Secured 542 Rank in West Bengal JEE and was among the top 0.36% of about 1.5 lakh candidates.

  • Qualified the prestigious JEE Mains examination and was among the top 3% of about 1 million candidates.

  • Secured an International rank of 96 in the International Olympiad of Mathematics, organized by SilverZone in 2016.

ACHIEVEMENTS

GALLERY

bottom of page