Pinned Repositories
AAAI_2023_Hierarchical-constrained-RL
"Constrained Reinforcement Learning in Hard Exploration Problems" Pathmanathan Pankayaraj, Pradeep Varakantham. AAAI Conference on Artificial Intelligence 2022
AdvBDGen
Code base for your work "AdvBDGen: Adversarially fortified prompt-specific fuzzy backdoor generator against llm alignment"
Cognitive_Computation-2023_Continual-Learning-With-Curiosity
"Using Curiosity for an Even Representation of Tasks in Continual Offline Reinforcement Learning" Pankayaraj Pathmanathan, Natalia Díaz-Rodríguez, Javier Del Ser. Cognitive Computation journal 2023
ICML_2024_RLHFPoisoning
"Is poisoning a real threat to LLM alignment? Maybe more so than you think" Pankayaraj Pathmanathan, Souradip Chakraborty, Xiangyu Liu, Yongyuan Liang, Furong Huang. ICML 2024 Workshop MHFAIA
LLM_Next_Word_Prediction
Code for next word prediction training based on the BookMIA dataset. This is part of the code for tests done of the work "Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?"
Multi-Arm-Bandit-Library
A python based library which includes multi_arm_bandit and Bayesian_optimization_algorithms. The PYPI repository can be found as mabandit 1.3
Programming_Algorithms
Small library of personal programming algorithm implementations
RLHF_Poisoning
sitnshop
An web application(+mobile application) to advertise any kinds of shops and more additional features
Temporal-Attention-Based-MARL
pankayaraj's Repositories
pankayaraj/Cognitive_Computation-2023_Continual-Learning-With-Curiosity
"Using Curiosity for an Even Representation of Tasks in Continual Offline Reinforcement Learning" Pankayaraj Pathmanathan, Natalia Díaz-Rodríguez, Javier Del Ser. Cognitive Computation journal 2023
pankayaraj/Temporal-Attention-Based-MARL
pankayaraj/ICML_2024_RLHFPoisoning
"Is poisoning a real threat to LLM alignment? Maybe more so than you think" Pankayaraj Pathmanathan, Souradip Chakraborty, Xiangyu Liu, Yongyuan Liang, Furong Huang. ICML 2024 Workshop MHFAIA
pankayaraj/sitnshop
An web application(+mobile application) to advertise any kinds of shops and more additional features
pankayaraj/AAAI_2023_Hierarchical-constrained-RL
"Constrained Reinforcement Learning in Hard Exploration Problems" Pathmanathan Pankayaraj, Pradeep Varakantham. AAAI Conference on Artificial Intelligence 2022
pankayaraj/Multi-Arm-Bandit-Library
A python based library which includes multi_arm_bandit and Bayesian_optimization_algorithms. The PYPI repository can be found as mabandit 1.3
pankayaraj/RLHF_Poisoning
pankayaraj/AdvBDGen
Code base for your work "AdvBDGen: Adversarially fortified prompt-specific fuzzy backdoor generator against llm alignment"
pankayaraj/cepdnaclk.github.io
Github pages website for Department of Computer Engineering, University of Peradeniya
pankayaraj/CMSC742
pankayaraj/deepwalk
DeepWalk - Deep Learning for Graphs
pankayaraj/Diverse_RL
pankayaraj/LLM_Next_Word_Prediction
Code for next word prediction training based on the BookMIA dataset. This is part of the code for tests done of the work "Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?"
pankayaraj/Django_Server_Sleep_Apnea
Central server made on django and rest framework to facilitate the detection of sleep apnea problem
pankayaraj/ECC_20_MAMAB
"A Decentralized Communication Policy for Multi Agent Multi Armed Bandit Problems" P Pankayaraj, DHS Maithripala
pankayaraj/fall2022
TA course for Fall 2022
pankayaraj/google-research
Google Research
pankayaraj/Machine_Learning_Algorithms_NUMPY
Implementation of Machine Algorithms including NN in numpy
pankayaraj/Models_2024
pankayaraj/NeuralNetworkProject
Application of various neural networks on MNIST data set(On going)
pankayaraj/pankayaraj.github.io
Personal We Page
pankayaraj/PoisonedRAG
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
pankayaraj/Reinforcement_Learning
Reinforcement learning algorithms taught by David Silver on youtube to small scale problems
pankayaraj/RL_Pretraining
pankayaraj/sac
Soft Actor-Critic
pankayaraj/Sleep_Apnea_Detection-1
Non intrusive method for detecting sleep apnea in infants.
pankayaraj/soft-actor-critic
Implementation of the paper Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
pankayaraj/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
pankayaraj/trojan-detection
pankayaraj/website