pankayaraj

Pinned Repositories

AAAI_2023_Hierarchical-constrained-RL
"Constrained Reinforcement Learning in Hard Exploration Problems" Pathmanathan Pankayaraj, Pradeep Varakantham. AAAI Conference on Artificial Intelligence 2022
Language:Python1 2 00
AdvBDGen
Code base for your work "AdvBDGen: Adversarially fortified prompt-specific fuzzy backdoor generator against llm alignment"
Language:Python0 1 00
Cognitive_Computation-2023_Continual-Learning-With-Curiosity
"Using Curiosity for an Even Representation of Tasks in Continual Offline Reinforcement Learning" Pankayaraj Pathmanathan, Natalia Díaz-Rodríguez, Javier Del Ser. Cognitive Computation journal 2023
Language:Python8 3 10
ICML_2024_RLHFPoisoning
"Is poisoning a real threat to LLM alignment? Maybe more so than you think" Pankayaraj Pathmanathan, Souradip Chakraborty, Xiangyu Liu, Yongyuan Liang, Furong Huang. ICML 2024 Workshop MHFAIA
Language:Python2 2 01
LLM_Next_Word_Prediction
Code for next word prediction training based on the BookMIA dataset. This is part of the code for tests done of the work "Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?"
Language:Python0 2 00
Multi-Arm-Bandit-Library
A python based library which includes multi_arm_bandit and Bayesian_optimization_algorithms. The PYPI repository can be found as mabandit 1.3
Language:Python1 1 02
Programming_Algorithms
Small library of personal programming algorithm implementations
Language:Python0 1 00
RLHF_Poisoning
Language:Python10
sitnshop
An web application(+mobile application) to advertise any kinds of shops and more additional features
Language:Python21
Temporal-Attention-Based-MARL
Language:Python4 3 03

pankayaraj's Repositories

pankayaraj/Cognitive_Computation-2023_Continual-Learning-With-Curiosity
"Using Curiosity for an Even Representation of Tasks in Continual Offline Reinforcement Learning" Pankayaraj Pathmanathan, Natalia Díaz-Rodríguez, Javier Del Ser. Cognitive Computation journal 2023
Language:Python8 3 10
pankayaraj/Temporal-Attention-Based-MARL
Language:Python4 3 03
pankayaraj/ICML_2024_RLHFPoisoning
"Is poisoning a real threat to LLM alignment? Maybe more so than you think" Pankayaraj Pathmanathan, Souradip Chakraborty, Xiangyu Liu, Yongyuan Liang, Furong Huang. ICML 2024 Workshop MHFAIA
Language:Python2 2 01
pankayaraj/sitnshop
An web application(+mobile application) to advertise any kinds of shops and more additional features
Language:Python21
pankayaraj/AAAI_2023_Hierarchical-constrained-RL
"Constrained Reinforcement Learning in Hard Exploration Problems" Pathmanathan Pankayaraj, Pradeep Varakantham. AAAI Conference on Artificial Intelligence 2022
Language:Python1 2 00
pankayaraj/Multi-Arm-Bandit-Library
A python based library which includes multi_arm_bandit and Bayesian_optimization_algorithms. The PYPI repository can be found as mabandit 1.3
Language:Python1 1 02
pankayaraj/RLHF_Poisoning
Language:Python10
pankayaraj/AdvBDGen
Code base for your work "AdvBDGen: Adversarially fortified prompt-specific fuzzy backdoor generator against llm alignment"
Language:Python0 1 00
pankayaraj/cepdnaclk.github.io
Github pages website for Department of Computer Engineering, University of Peradeniya
Language:HTML0 1 00
pankayaraj/CMSC742
Language:Python00
pankayaraj/deepwalk
DeepWalk - Deep Learning for Graphs
Language:Python00
pankayaraj/Diverse_RL
Language:Python0 1 00
pankayaraj/LLM_Next_Word_Prediction
Code for next word prediction training based on the BookMIA dataset. This is part of the code for tests done of the work "Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?"
Language:Python0 2 00
pankayaraj/Django_Server_Sleep_Apnea
Central server made on django and rest framework to facilitate the detection of sleep apnea problem
Language:HTML1 0
pankayaraj/ECC_20_MAMAB
"A Decentralized Communication Policy for Multi Agent Multi Armed Bandit Problems" P Pankayaraj, DHS Maithripala
Language:Python3 0
pankayaraj/fall2022
TA course for Fall 2022
Language:Jupyter Notebook0 0
pankayaraj/google-research
Google Research
pankayaraj/Machine_Learning_Algorithms_NUMPY
Implementation of Machine Algorithms including NN in numpy
Language:Python1 0
pankayaraj/Models_2024
pankayaraj/NeuralNetworkProject
Application of various neural networks on MNIST data set(On going)
Language:Python1 0
pankayaraj/pankayaraj.github.io
Personal We Page
Language:HTML
pankayaraj/PoisonedRAG
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
Language:Python0 0
pankayaraj/Reinforcement_Learning
Reinforcement learning algorithms taught by David Silver on youtube to small scale problems
Language:Python
pankayaraj/RL_Pretraining
Language:Python
pankayaraj/sac
Soft Actor-Critic
Language:Python
pankayaraj/Sleep_Apnea_Detection-1
Non intrusive method for detecting sleep apnea in infants.
Language:HTML1 0
pankayaraj/soft-actor-critic
Implementation of the paper Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Language:Python
pankayaraj/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Language:Python2 0
pankayaraj/trojan-detection
Language:Jupyter Notebook
pankayaraj/website