ssfinetuning package

Submodules

ssfinetuning.dataset_utils module

class ssfinetuning.dataset_utils.SimpleDataset(dataset)

Bases: Generic[torch.utils.data.dataset.T_co]

A simple dataset for utilities in Co-Training and Tri-Training.

Args:

dataset (Union[SimpleDataset, dataset]): Can be SimpleDataset object or pyarrow based dataset object.

Class attributes:

-original_len: Length of the dataset at the instantiation.

-to_append_dic: The dictionary used in appending unlabeled examples in dataset.

-batch_masks: This dictionary keeps track of the unlabeled examples which are removed and inserted in the dataset during appending procedure.

append(ul_data, mask=None, batch_index=None)

Function used during the appending procedure.

Args:

ul_data (torch.FloatTensor): Unlabeled data batch.

mask (torch.BoolTensor): Mask of the data object which are going to accepted from the batch. This object also helps in keeping track of the examples which are inserted.

batch_index (:obj: ´int´): Index of the batch of unlabeled data. To be used by batch_masks dictionary.

Return: mask_change.sum() (:obj: ´int´): Sum of any insertion and deletion of examples in the dataset.

extend_length(length): Extends the length of the dataset by randomly repeating length amount of rows.

reformat(): After appending using usual list appending, dataset is reformat to huggingface dataset format.

reset(): Resets the dataset to the original length at the instantiation.

ssfinetuning.dataset_utils.dic_to_pandas(history, loss_key='eval_loss', accuracy_measure='eval_matthews_correlation')

Function to convert the list of dictionary to pandas DataFrame which is easier for the plotting function in plotting utils to handle

Args:

history (list): A list of history dictionaries. It is basically transformer.TrainerState() at different hyperparameters analysed.

loss_key (str): The key to look for. In the case of analysis of evaluation history, the key is ‘eval_loss’. In the case of analysis of training history the key is ‘train_loss’

accuracy_measure (str): Name of the metric used during evaluation.

ssfinetuning.dataset_utils.extract_keys(function, kwargs, remove_from_orignal=True): Function which extract the keys of “function” from “kwargs” by inspecting the signature of “function”.

ssfinetuning.dataset_utils.match_with_batchsize(lim, batchsize): Function used by modify_datasets below to match return the integer closest to lim which is multiple of batchsize, i.e., lim%batchsize=0.

ssfinetuning.dataset_utils.modify_datasets(dataset, labeled_fr=0.5, model_type='TriTrain', labeled1_frac=0.33, train_key='train', label_column='label', unlabeled_labels=- 1, batchsize=16)

Function to modify pyarrow based datasets (huggingface dataset) for testing at different fraction of labeled vs unlabeled data.

Args:

dataset (dataset.DatasetDict): Dictionary containing training and validation datasets.

labeled_frac (float): Fraction of training dataset to be kept as labeled dataset and rest will be divided as unlabeled dataset.

model_type (str): Semi supervised model type.

labeled1_frac (float): In the case of CoTraining and TriTraining model_type, this is the fraction given to the first two models (m1 and m2) after being divided by labeled_fr. Rest is given to model 3. For example, labeled1_frac=0.33, m1 and m2 gets 0.33 and m3 gets (1-2*0.33)

train_key (str): Key value of where training data is accessed.

label_column (str): Key value of where columns for labels.

unlabeled_labels (int): Value to be assigned to the unlabeled dataset labels, required for Pi, TemporalEnsemble, and MeanTeacher as they need to know which ones are unlabeled examples.

batchsize (int): Batch size used during training.

Return: dataset.DatasetDict object with labeled and unlabeled data.

ssfinetuning.default_args module

class ssfinetuning.default_args.DefaultArgs

Bases: object

get_default_ta(logging_dir=''): Return the TrainingArguments with the logging_dir setup for semisupervised model.

get_default_ta_sup(logging_dir=''): Return the TrainingArguments with the logging_dir setup for supervised model.

set_default_args(dataset, model_name, kwargs): Function for setting the default arguments if these keywords are not provided in kwargs of train_with_ssl. Updates the kwargs to be used transformers.Trainer. Args: dataset (DatasetDict ): Dataset dictionary containing labeled and unlabelled data. model_name (str or os.PathLike): “pretrained_model_name_or_path” in ~transformers.PreTrainedModel, please refer to its documentation for further information. (i) In this case of a string, the model id of a pretrained model hosted inside a model repo on huggingface.co. (ii) It could also be address of saved pretrained model. kwargs (dict): keyword arguments to be used by transformers.Trainer.

ssfinetuning.default_args.encode(dataset, model_name='albert-base-v2', text_column_name='sentence')

Function for encoding the dataset using tokenizer

Args:

dataset (DatasetDict ): Dataset dictionary containing labeled and unlabeled data.

model_name (str or os.PathLike): “pretrained_model_name_or_path” in ~transformers.PreTrainedModel, please refer to its documentation for further information.

(i) In this case of a string, the model id of a pretrained model hosted inside a model repo on huggingface.co.

It could also be address of saved pretrained model.

text_column_name (str): column name for where the text is.

Return:

encoded_dataset (DatasetDict ): containing columns now which are required by the forward function.

tokenizer (PreTokenizer ):

ssfinetuning.default_args.get_default_cm(): Default compute metric function.

ssfinetuning.models module

class ssfinetuning.models.BaseModelClass(model_name='albert-base-v2', supervised_run=False, num_labels=2, classifier_dropout=0.1, num_models=1, ssl_model_type=None)

Bases: torch.nn.modules.module.Module

Base class for all model with single pretrained model, but might have multiple classifier layers.

Args:

model_name (str or os.PathLike): “pretrained_model_name_or_path” in ~transformers.PreTrainedModel, please refer to its documentation for further information.

(i) In this case of a string, the model id of a pretrained model hosted inside a model repo on huggingface.co.

It could also be address of saved pretrained model.

supervised_run (bool): If the model is taken from the supervised run or not. In that case transformer_model_name is the path to the saved model.

num_labels (int): number of labels to be classified.

classifier_dropout (float): dropout probability of the classifier layers.

num_models (int): number of models, i.e. number of classifier layers (only set by the sub classes).

ssl_model_type (str): semi supervised learning model type (only set by the sub classes).

simple_forward_with_prob_logits(classifier_num=0, **kwargs)

This function first changes the pointer of the pretrained_model to the one of the classifier defined in this class. Then applies softmax to it and thus converts it to probability logits.

Args:

classifier_num: Index of the classifier to be used.

kwargs: Arguments from pretrained_model.forward.

Return:

logits ( torch.FloatTensor): probability logits.

training: bool

class ssfinetuning.models.BaseMultiPretrained(teacher_student_name=('albert-base-v2', 'albert-base-v2'), num_labels=2, teacher_dropout=None, student_dropout=None, ssl_model_type=None)

Bases: torch.nn.modules.module.Module

Base class for all models with multiple pretrained model.

Args:

ssl_model_type (str): semi supervised learning model type.

teacher_student_name (Tuple[`str, str]): A Tuple for teacher and student name, respectively. “pretrained_model_name_or_path” in ~transformers.PreTrainedModel, please refer to its documentation for further information.

(i) In this case of a string, the model id of a pretrained model hosted inside a model repo on huggingface.co.

It could also be address of saved pretrained model.

num_labels (int): number of labels to be classified.

student_dropout (float): dropout probability of the student classifier layers.

teacher_dropout (float): dropout probability of the teacher classifier layers.

training: bool

class ssfinetuning.models.CoTrain(o_weight=0.01, num_labels=2, model_name='albert-base-v2', classifier_dropout=0.1, ssl_model_type='CoTrain', num_models=2, supervised_run=False)

Bases: ssfinetuning.models.BaseModelClass

Implementation of Co Training as introduced in <https://www.cs.cmu.edu/~avrim/Papers/cotrain.pdf>

Args:

o_weight (float): Orthogonality weight for the two classifiers (or two models).

kwargs: remaining dictionary of keyword arguments from the BaseModelClass.

cotrain_forward(model1_batch, model2_batch)

Forward function used during training of models. See ~trainer_utils.TrainerForCoTraining for more details.

Args: model1_batch (:obj: torch.FloatTensor) batch for model 1. model2_batch (:obj: torch.FloatTensor) batch for model 2.

Return: CoTrainModelOutput object with the information of logits of both models and the loss function.

forward(**kwargs)

Forward function only used during the evaluation of models. See ~trainer_utils.TrainerForCoTraining and ~transformers.Trainer for more details.

Args:

kwargs: Arguments from pretrained_model.forward.

Return: CoTrainModelOutput object with the information of logits of both models and the loss function.

training: bool

class ssfinetuning.models.CoTrainModelOutput(loss: Union[torch.FloatTensor, NoneType] = None, logits_m1: torch.FloatTensor = None, logits_m2: torch.FloatTensor = None)

Bases: transformers.file_utils.ModelOutput

logits_m1: torch.FloatTensor = None

logits_m2: torch.FloatTensor = None

loss: Optional[torch.FloatTensor] = None

class ssfinetuning.models.MeanTeacher(teacher_student_name=('albert-base-v2', 'albert-base-v2'), num_labels=2, unsup_weight=0, teacher_dropout=None, alpha=0.5, student_dropout=None)

Bases: ssfinetuning.models.BaseMultiPretrained

Implementation of Mean Teacher as introduced in <https://arxiv.org/abs/1703.01780>

Args:

alpha (float): memory of the last epochs.

unsup_weight: Initial unsupervised weight.

Class attributes:

-firstpass: bool variable to track if its the first pass through the forward method.

forward(**kwargs)

Implementation of forward function calculating the semi supervised loss. Mixing of the labeled and unlabeled examples in a single batch is not allowed.

Args:

kwargs: Arguments from pretrained_model.forward.

Return: transformers.modeling_outputs.SequenceClassifierOutput object with the information of logits and the loss function.

training: bool

update_teacher_variables(): Function for updating teacher weights and bias. Directly used from <https://github.com/CuriousAI/mean-teacher>

zero_teacher_weights(module): Function for zeroing the teachers weights and biases.

class ssfinetuning.models.NoisyStudent(teacher_dropout=None, student_dropout=None, num_labels=2, teacher_student_name=('albert-base-v2', 'albert-base-v2'))

Bases: ssfinetuning.models.BaseMultiPretrained

Implementation of Noisy Student as introduced in <https://arxiv.org/abs/1911.04252>

Args:

kwargs: keyword arguments are the same as is for BaseMultiPretrained class, except model type string. Class forward initialized with teacher as the teacher is trained first.

training: bool

class ssfinetuning.models.PiModel(unsup_weight=0, num_labels=2, model_name='albert-base-v2', classifier_dropout=0.1, supervised_run=False)

Bases: ssfinetuning.models.BaseModelClass

Implementation of pi model from <https://arxiv.org/abs/1610.02242>.

Args:

unsup_weight (float): Initial value of the weight of the unsupervised loss component. Its value is controlled by unsupervised weight scheduler.

kwargs: remaining dictionary of keyword arguments from the BaseModelClass.

forward(**kwargs)

Implementation of forward function calculating the semi supervised loss. Mixing of the labeled and unlabeled examples in a single batch is not allowed.

Args:

kwargs: Arguments from pretrained_model.forward.

Return: transformers.modeling_outputs.SequenceClassifierOutput object with the information of logits and loss function.

training: bool

class ssfinetuning.models.TemporalEnsembleModel(unsup_weight=0, num_labels=2, model_name='albert-base-v2', alpha=0.5, classifier_dropout=0.1, supervised_run=False)

Bases: ssfinetuning.models.BaseModelClass

Implementation of Temporal ensemble model as introduced in <https://arxiv.org/abs/1610.02242>

Args:

alpha (float): memory of the last epochs. For more info please refer to <https://arxiv.org/abs/1610.02242>.

unsup_weight (float): initial value of weight of the unsupervised loss component. After setting the initial value, its value is controlled by unsupervised weight scheduler.

kwargs: remaining dictionary of keyword arguments from the BaseModelClass.

Class attributes:

-mini_batch_num: keeps track of the mini_batch_num using forward method.

-logits_batchwise: stores the logits of each batch passed through forward method.

-firstpass: bool variable to track if its the first pass through the forward method.

forward(**kwargs)

Implementation of forward function calculating the semi supervised loss. Mixing of the labeled and unlabeled examples in a single batch is not allowed.

Args:

kwargs: Arguments from pretrained_model.forward.

Return: transformers.modeling_outputs.SequenceClassifierOutput object with the information of logits and the loss function.

training: bool

update_memory_logits(t)

Method for updating the memory logits with the exponential average.

Args: t (int): epoch value for bias normalization.

class ssfinetuning.models.TriTrain(o_weight=0.01, num_labels=2, classifier_dropout=0.1, model_name='albert-base-v2', ssl_model_type='CoTrain', supervised_run=False)

Bases: ssfinetuning.models.CoTrain

Implementation of Tri Training(multi task TriTrain) as introduced in <https://arxiv.org/abs/1804.09530>. Note: Here the implementation is at only the fine tuning. The base network is to be pretrained transformer model.

Args:

kwargs: keyword arguments are the same as is for CoTrain class, except model type string and number of models(num_models) as obvious with the name.

forward(**kwargs)

Forward function used during evaluation of trained models. See ~trainer_utils.TrainerForTriTraining and ~transformers.Trainer for more details.

Args:

kwargs: Arguments from pretrained_model.forward.

Return: TriTrainModelOutput object with the information of logits of both models and the loss function.

m3_forward(**kwargs)

Forward function for model 3. See ~trainer_utils.TrainerForTriTraining and ~transformers.Trainer for more details.

Args:

kwargs: Arguments from pretrained_model.forward.

Return: TriTrainModelOutput object with the information of logits of both models and the loss function.

training: bool

class ssfinetuning.models.TriTrainModelOutput(loss: Union[torch.FloatTensor, NoneType] = None, logits_m1: torch.FloatTensor = None, logits_m2: torch.FloatTensor = None, logits_m3: torch.FloatTensor = None)

Bases: transformers.file_utils.ModelOutput

logits_m1: torch.FloatTensor = None

logits_m2: torch.FloatTensor = None

logits_m3: torch.FloatTensor = None

loss: Optional[torch.FloatTensor] = None

ssfinetuning.models.add_signature_from(base)

ssfinetuning.plotting_utils module

ssfinetuning.plotting_utils.add_end_args(from_fn)

ssfinetuning.plotting_utils.get_default_legend_pos(num_graphs, axes_index=None)

Sets the default values where legends could be placed.

Args:

num_graphs (int ): num of num_graphs to be plotted with maximum value of 4.

axes_index (int , optional, defaults to None ): In the case of multiple, setting changed depending on index of axes.

ssfinetuning.plotting_utils.plot_in(axes, axes_index=0, totplots=1, data=None, data_to_compare=None, x_axis_col='epoch', y_axis_col='eval_mc', select_best=5, criteria='max', cols_to_find=['w_ramprate'], dis_col='l_fr', dis_val=False, data_to_compare_lb='sup_stats')

Main plotting function.

Args:

axes (matplotlib.pyplot.axes ): axes object to plot.

axes_index (int, optional, defaults to None ): In the case of multiple, setting changed depending on index of axes.

totplots (obj: int): Total number of plots to be plotted.

data (pd.DataFrame ): Data to sort from.

data_to_compare (pd.DataFrame ): Data to compare with the sorted results. For example, purely supervised results.

x_axis_col (str, optional, defaults to ‘epoch’ ): Column name with the values to be plotted on the x axis.

y_axis_col (str, optional, defaults to ‘eval_mc’ ): Column name with the values to be plotted on the y axis.

select_best (int, optional, defaults to 5 ): The number of plots to be made based out of the sorted list.

criteria (str, optional, defaults to ‘max’ ): Criteria to sort the list. There are three choices, (i) max, (ii) min, and (iii)mean.

cols_to_find (list, optional, defaults to [‘w_ramprate’] ): The list of column names which will analysed to find the best of them based the sorting criteria.

dis_col (str, optional, defaults to ‘l_fr’ ): The dicriminatory column name. This would be column name along which the graphs would be divided along the subplots.

dis_val (int or ‘float’, optional, defaults to ‘None’ ): This is only valid if the ‘dis_col’ is not None. This is used when a certain unique value of discriminatory column is plotted.

data_to_compare_lb (str, optional, defaults to ‘sup_stats’ ): Label name for the data_to_compare plot.

ssfinetuning.plotting_utils.plot_with_discriminator(dis_col, save_png, data=None, *args, **kwargs)

Plotter if discriminatory column is specified.

Args:

dis_col (str ): The dicriminatory column name. This would be column name along which the graphs would be divided along the subplots.

save_png (str ): Whether to save png of results or not. If the value of save_png is not None then it would save the image with name of string value set in save_png.

kwargs: remaining dictionary of keyword arguments from the plot_in function.

Adding Args from function-> plot_in