Change type of access for methods

Question

Change type of access for methods

WinterSilence opened this issue 8 years ago · 34 comments

Change type from private to protected for extending classes.

Answer 1 · 2016-04-21T12:03:30.000Z

Which concrete method do you need and for which case? Blindly changing all methods to protected is a bad idea.

Answer 2 · 2016-04-21T12:24:42.000Z

Это не плохая идея т.к. все методы\свойства взаимосвязаны. Я хочу хранить данные в кеше, а не в файлах, следовательно мне нужно переопределить минимум Index: write() + Sitemap: createNewFile(), flush(), но само собой эти методы работают с другими методами и переменными которые что? - правильно! закрыты...

Answer 3 · 2016-04-21T12:41:30.000Z

As a variant: you can add a new branch and implement it in version that will return content (not saved in files).

Answer 4 · 2016-04-21T12:58:00.000Z

Опишите свой кейс. Для чего именно вам хранить sitemap в кеше? Как вы его отдаёте?

Answer 5 · 2016-04-21T15:23:22.000Z

Кеш (https://github.com/kohana/cache).
Контроллер загружает из кеша \ генерирует sitemap xml и отдает клиенту. Обращений к sitemap немного, поэтому такой вариант намного удобнее чем cron задание или пересоздание при каждом изменении в списке ссылок.

Answer 6 · 2016-04-21T15:26:38.000Z

У вас XML в кеше?

Answer 7 · 2016-04-21T15:41:55.000Z

У меня xml текст в файловом кеше и каких-либо проблем по этому поводу я не испытываю. Основная мысль - дать возможность пользователю самому решать как ему хранить данные, не вижу смысла в подобном ограничении расширяемости, можно провести небольшой рефакторинг и отделить момент генерации от сохранения контента.

Answer 8 · 2016-04-21T15:51:21.000Z

У Вас висит пару месяцев PR #16, пользователь не может реализовать сжатие в своем варианте, поэтому вынужден расширять Ваш, а Вы как я понимаю этого не хотите. Как итог: смысл библиотеки теряется.

Answer 9 · 2016-04-21T16:34:15.000Z

Генерацию от сохранения отделить можно, да. Но всё делать protected я против.

Answer 10 · 2016-04-21T18:40:16.000Z

Ты сам посмотри что в итоге останется private если сделать эти методы protected. Сам разделением заниматься будешь или мне попробовать?

Answer 11 · 2016-04-22T13:47:35.000Z

Я уже начал над этим работать, но с удовольствием посмотрел бы на альтернативное решение.

Answer 12 · 2016-04-22T20:59:27.000Z

Сомневаюсь что оно будет альтернативным, но ради интереса попробую.

Answer 13 · 2016-04-28T21:31:50.000Z

I want to extend the classes and override the write() and flush() method so that I can have it write to Amazon S3 instead of the local filesystem. I can't do that because the methods are marked private instead of protected. I think getCurrentFilePath() and $this->writer would also need to be protected.

Answer 14 · 2016-04-28T23:15:09.000Z

@ejunker I think it's better to abstract writing into separate interface / class. If you have ideas about it, let's discuss it.

Answer 15 · 2018-03-09T05:55:54.000Z

@samdark @WinterSilence Any news on this? We have a use case where we just want the xml to be rendered using output memory rather in a file. Right now there is no option to get sitemap instance without passing a file to it.

get/set the writer so it can be used with/without file
if file is not provided, it can assume $this->writer->openURI('php://output');

class NullWriter implements WriterInterface
{
    public function __construct()
    {
    }

    public function append($data)
    {
    }

    public function finish()
    {
    }
}

__construct($filePath = null, $useXhtml = false)

if ($filePath !== null) {
    $dir = dirname($filePath);

    if (!is_dir($dir)) {
        throw new \InvalidArgumentException(
            "Please specify valid file path. Directory not exists. You have specified: {$dir}."
        );
    }

    $this->filePath = $filePath;
    $this->useXhtml = $useXhtml;
}

createNewFile()

if ($filePath) {
    if ($this->useGzip) {
        if (function_exists('deflate_init') && function_exists('deflate_add')) {
            $this->writerBackend = new DeflateWriter($filePath);
        } else {
            $this->writerBackend = new TempFileGZIPWriter($filePath);
        }
    } else {
        $this->writerBackend = new PlainFileWriter($filePath);
    }
} else {
    $this->writerBackend = new NullWriter();
}

if ($this->writerBackend instanceof NullWriter) {
    $this->writer->openURI('php://output');
} else {            
    $this->writer->openMemory();
}

setIndentString() implementation to set the spacing

That is the general idea. I have seen other issue where complete new solution is suggested. Any news on that if any implementation is being done or in process.

Answer 16 · 2018-03-09T12:37:11.000Z

No implementation was done or is in progress. While it's OK to have an ability to write to any stream, it's not clear what's your particular use case.

Answer 17 · 2018-03-09T12:42:58.000Z

@samdark I don't want to use file to store xml. Just want to render it to the user. Are you open to accept PR with suggested changes?

Answer 18 · 2018-03-09T12:46:28.000Z

That's a very bad idea:

Sitemap doesn't change that often.
You'll get quite big memory usage considering default buffering in most frameworks.
You won't be able to match standards on number of URLs in sitemap and sitemap size.
You won't be able to use multiple sitemaps.

Answer 19 · 2018-03-09T12:51:00.000Z

Considering a dynamic site where 100s of new stories are pushed on daily basis
It will follow the same buffering as right now
It will match the number of urls allowed, will flush and will never be more than 50K
While writing to output uri no multiple sitemaps required. If so will fall back to original file implementation. (Its an extension, nothing to dump the old functionality altogahter)
Don't want to manage file on regular basis (file concurrency issues for crud possibility)
Will cache stream and no server load to generate each time

Answer 20 · 2018-03-09T13:00:30.000Z

That's fine. If you regenerate sitemap on publishing new story means only 200 regenerations. If you generate it dynamically each time it means significantly more.
Currently it dumps into file steam releasing memory. In case of output buffering it won't release memory till buffer is flushed that, in case of modern frameworks, it is at the very end of request-response cycle.
That contradicts "100s of new stories are pushed on daily basis". You'll reach the limit in at max 1 year and 3 months with such publishing rate.
Wrong. Search engine spider doesn't care how you serve the content. It has a hard limit of 50000 URLs and 10 megabytes of payload. It you're pass the limit it will error not accepting your sitemap at all.

Answer 21 · 2018-03-09T13:03:55.000Z

Writer flush will handle it by itself, as stream is output uri, it will release as soon as writer flush is called
We only get the newly published urls not all, so 50K will still not reach
Same as 3

Answer 22 · 2018-03-09T13:05:49.000Z

Yes, the problem is when it's called but yeah, that could be irrelevant and it's fine not to care about it.
Huh? It doesn't make sense. Search engines, as far as I remember, expect a full sitemap, not partial.

Answer 23 · 2018-03-09T13:07:55.000Z

It will be a full sitemap with new urls. So whenever the urls are updated new urls will be indexed as old are already being crawled

Answer 24 · 2018-03-09T13:27:53.000Z

I think it's a bad idea to submit just new URLs and omit old ones: https://webmasters.stackexchange.com/questions/2459/should-i-include-everything-in-the-sitemap-or-only-new-content. No search engine guarantees that if your page reached index it will stay there forever.

Answer 25 · 2018-03-09T14:09:37.000Z

Got it. But I think its a separate issue while thinking of writing xml to uri rather file system or memory.
Still need a way to render content to source uri. right?

Answer 26 · 2018-03-09T20:53:41.000Z

Well, it was designed to work with files. For example, it advanced to next file when current one is full: https://github.com/samdark/sitemap/blob/master/Sitemap.php#L227. I'm not sure how to do that if you've specified a stream that's no a file.

Answer 27 · 2018-03-10T00:04:11.000Z

Nothing happens as said earlier, writer just flushes the stream and carries on with the processing

$this->writerBackend instanceof NullWriter do nothing
Rather making file or memory objects on fly, one should inject in __construct()
Depending on type of injection respective createNewFile() or createSourceUri() can be called
Injecting object can change the code a bit and will not be backward compatible
Creating object on fly, like it does at the moment and instanceof check will do

Answer 28 · 2018-03-10T21:20:00.000Z

Carries on with processing where? For files it opens a new file. Continuing flushing to output in this case would result in incorrect sitemap that won't be parsed by search engines. It doesn't make sense to me.

Same about memory. You'll need a sitemap index and a set of sitemaps but these are read in multiple requests so you can't direct these into streams.

Answer 29 · 2018-03-12T05:58:09.000Z

It will output all the stream to source uri there will be no new create file due to check of NullWriter.

Answer 30 · 2018-03-12T08:18:06.000Z

But that basically won't be read when you reach 50000 elements or a 10 MB of data and you have to write all URLs into sitemap.

Answer 31 · 2018-03-12T08:20:32.000Z

Agreed, but we don't have a use case of 50K urls or 10MB of data. If that is the case fallback is always the original implementation i.e. File System

Answer 32 · 2018-03-12T08:32:45.000Z

If you have FS, why do you even need to generate anything runtime at all?

Answer 33 · 2018-03-12T08:40:09.000Z

Right now, we don't have any need to use filesystem when we can manage on runtime. Handling File Read/write Concurrency not desired

Answer 34 · 2018-03-12T08:55:20.000Z

I see. Well, sorry but I don't think having a way to easily shoot into your own leg is a good idea. Therefore, do your own fork.